Ehrlichia ewingii proteins, nucleic acids, and methods of their use

ABSTRACT

The novel omp-1 gene cluster encoding twenty one  Ehrlichia ewingii  (EE) proteins was isolated and sequenced completely. This invention relates to isolated  E. ewingii  (EE) polypeptides, isolated polynucleotides encoding EE polypeptides, probes, primers, isolated antibodies and methods of their production, immunogenic compositions and vaccines, as well as methods of using the EE polypeptides, antibodies, probes, and primers for the purpose of diagnosis, therapy and production of vaccines against  E. ewingii.

RELATED APPLICATIONS

This application claims priority to U.S. Provisional Application Ser. No. 60/916,227, filed May 4, 2007; and U.S. Provisional Application Ser. No. 61/016,348, filed Dec. 21, 2007; the entire contents of which are incorporated herein by reference.

GOVERNMENT RIGHTS

This invention was made, at least in part, with federal funding from the National Institutes of Health Grant R01AI47407. The United States Government may have certain rights in this invention.

BACKGROUND

Ehrlichia ewingii, a tick-transmitted rickettsia previously known only as a canine pathogen, is the most recently recognized human granulocytic ehrlichiosis agent. Granulocyte-tropic Ehrlichia was first reported by Dr. S. A. Ewing in 1971 in a dog from Arkansas and was thought to be a granulocytic variant of Ehrlichia canis. Granulocyte-tropic Ehrlichia was recognized as a separate species in 1992, based on 16S rRNA gene sequence comparison and named as Ehrlichia ewingii in honor of Dr. S. A. Ewing. Since then, canine infection with E. ewingii has been reported in several states in the U.S. and recently from Africa. Clinical signs in dogs infected with E. ewingii are fever, lethargy, anorexia, lameness, and polyarthritis, accompanied with mild thromobocytopenia and mild anemia. In 1999, human infection with E. ewingii was documented. Since 1996, retrospectively, approximately 10 confirmed cases of human granulocytic ehrlichiosis caused by E. ewingii infection have been identified in Missouri and Oklahoma.

Diagnosis of E. ewingii infections has proven difficult. E. ewingii has yet to be cultivated, and there is no serologic test available to diagnose E. ewingii infection. Clinical signs of patients infected with E. ewingii, such as fever, headache, myalgia, leukopenia, and thrombocytopenia are similar to those of human monocytic ehrlichiosis caused by E. chaffeensis and human granulocytic anaplasmosis caused by A. phagocytophilum. Hence, clinical features alone cannot distinguish these causative agents. Further complicating the diagnosis of ehrlichiosis infections, E. ewingii and E. chaffeensis also share the same vector tick species and animal reservoirs. Experimentally, the Lone star tick (Amblyomma americanum) has been shown to be a competent vector, although bacterial DNA has been detected in other species of ticks. White-tailed deer (Odocoileus virginianus) is considered to be an important reservoir for E. ewingii and dogs are also possible reservoirs. Consequently, E. ewingii and E. chaffeensis have similar seasonal and geographic distributions. While bacteria have been seen on blood smears from infected animals and humans, and detected by PCR in the blood and tick specimens, to date E. ewingii remains uncultivable and a stable laboratory isolate is not available. PCR tests based on the E. ewingii-specific partial sequence of a 16S rRNA gene and a partial p28-19 sequence have been reported (Gusa, A. A., et al. 2001. J Clin Microbiol 39:3871-3876). Yet, sensitivities and specificities of E. ewingii PCR tests in clinical specimens are unknown, as there are no other definitive tests with which to compare. The microscopic observation of morulae in Romanovsky dye-stained peripheral blood granulocytes provides definitive proof of ehrlichial infection. Unfortunately, this test cannot be used as a single diagnostic test for E. ewingii infection because it cannot distinguish E. ewingii morulae from other granulocytic agents, such as A. phagocytophilum. Furthermore, negative results from Romanovsky dye-staining cannot rule out E. ewingii infection, owing to high false-negative rates caused by sample conditions and the low sensitivity of the assay. These setbacks in prior diagnostic testing necessitate an additional test to properly identify E. ewingii infection.

SUMMARY

Provided herein is an isolated E. ewingii (EE) polypeptide that includes an amino acid sequence of a mature EE protein or a functional derivative thereof. The mature EE protein is selected from the group consisting of: (1) amino acid 24 to 293 of SEQ ID NO 3 corresponding to a mature OMP-1-1 protein encoded by nucleotide 67203-1484 of SEQ ID NO: 1; (2) amino acid-22 to 272 of SEQ ID NO: 4 corresponding to a mature OMP-1-2 protein encoded by nucleotide 2116-2871 of SEQ ID NO: 1; (3) amino acid 24 to 284 of SEQ ID NO: 6 corresponding to a mature OMP-1-3 protein encoded by nucleotide 3610-4395 of SEQ ID NO: 1; (4) amino acid 28 to 293 of SEQ ID NO: 7 corresponding to a mature OMP-1-4 protein encoded by nucleotide 4486-5286 of SEQ ID NO: 1; (5) amino acid 24 to 272 of SEQ ID NO: 8 corresponding to a mature OMP-1-5 protein encoded by nucleotide 5380-6129 of SEQ ID NO: 1; (6) amino acid 26 to 299 of SEQ ID NO: 9 corresponding to a mature OMP-1-6 protein encoded by nucleotide 6216-7040 of SEQ ID NO: 1; (7) amino acid 27 to 284 of SEQ ID NO: 10 corresponding to a mature OMP-1-7 protein encoded by nucleotide 7145-7921 of SEQ ID NO: 1; (8) amino acid 29 to 243 of SEQ ID NO: 11 corresponding to a mature OMP-1-8 protein encoded by nucleotide 8032-8679 of SEQ ID NO: 1; (9) amino acid 28 to 281 of SEQ ID NO: 12 corresponding to a mature OMP-1-9 protein encoded by nucleotide 8772-9536 of SEQ ID NO: 1; (10) amino acid 26 to 280 of SEQ ID NO: 13 corresponding to a mature OMP-1-10 protein encoded by nucleotide 9620-10387 of SEQ ID NO: 1; (11) amino acid 28 to 290 of SEQ ID NO: 14 corresponding to a mature OMP-1-11 protein encoded by nucleotide 10477-11268 of SEQ ID NO: 1; (12) amino acid 27 to 298 of SEQ ID NO: 15 corresponding to a mature OMP-1-12 protein encoded by nucleotide 11370-12188 of SEQ ID NO: 1; (13) amino acid 30 to 302 of SEQ ID NO: 16 corresponding to a mature OMP-1-13 protein encoded by nucleotide 12292-13113 of SEQ ID NO: 1; (14) amino acid 26 to 285 of SEQ ID NO: 17 corresponding to a mature OMP-1-14 protein encoded by nucleotide 14530-15312 of SEQ ID NO: 1; (15) amino acid 26 to 278 of SEQ ID NO: 18 corresponding to a mature OMP-1-15 protein encoded by nucleotide 15689-16450 of SEQ ID NO: 1; (16) amino acid 26 to 282 of SEQ ID NO: 19 corresponding to a mature OMP-1-16 protein encoded by nucleotide 16861-17634 of SEQ ID NO: 1; (17) amino acid 26 to 272 of SEQ ID NO: 20 corresponding to a mature OMP-1-17 protein encoded by nucleotide 18479-19222 of SEQ ID NO: 1; (18) amino acid 33 to 282 of SEQ ID NO: 21 corresponding to a mature OMP-1-18 protein encoded by nucleotide 19558-20310 of SEQ ID NO: 1; or (19) amino acid 24 to 282 of SEQ ID NO: 22 corresponding to a mature OMP-1-19 protein encoded by nucleotide 21188-21967 of SEQ ID NO: 1. Excluded from the isolated polypeptide sequence are the sequence SEQ ID NO: 128, 130, 132, 134, 136; or any fragment thereof. Each EE polypeptide has a specific binding affinity for an anti-E. ewingii antibody.

In some embodiments, the functional derivative of the EE protein comprises a sequence which is at least 85%, 90%, 95%, or 98% identical to one of the sequences (1)-(19), described above.

In some embodiments, the functional derivative comprises an immunoreactive fragment that has a length of from 6 amino acids to less than the full length of the EE protein and comprises at least 6 consecutive amino acids of one or more sequences selected from the group consisting of: (1) SEQ ID NO: 137-155; (2) SEQ ID NO: 156-173; (3) SEQ ID NO: 174-191; (4) SEQ ID NO: 192-208; (5) SEQ ID NO: 209-227; and (6) any combination of the sequences (1)-(5). Each immunoreactive fragment has a specific binding affinity for an anti-E. ewingii antibody.

In some embodiments, the EE polypeptide comprises a sequence selected from the group consisting of: (1) SEQ ID NO: 3 corresponding to an immature OMP-1-1 protein; (2) SEQ ID NO: 4 corresponding to an immature OMP-1-2 protein; (3) SEQ ID NO: 6 corresponding to an immature OMP-1-3 protein; (4) SEQ ID NO: 7 corresponding to an immature OMP-1-4 protein; (5) SEQ ID NO: 8 corresponding to an immature OMP-1-5 protein; (6) SEQ ID NO: 9 corresponding to an immature OMP-1-6 protein; (7) SEQ ID NO: 10 corresponding to an immature OMP-1-7 protein; (8) SEQ ID NO: 11 corresponding to an immature OMP-1-8 protein; (9) SEQ ID NO: 12 corresponding to an immature OMP-1-9 protein; (10) SEQ ID NO: 13 corresponding to an immature OMP-1-10 protein; (11) SEQ ID NO: 14 corresponding to an immature OMP-1-11 protein; (12) SEQ ID NO: 15 corresponding to an immature OMP-1-12 protein; (13) SEQ ID NO: 16 corresponding to an immature OMP-1-13 protein; (14) SEQ ID NO: 17 corresponding to an immature OMP-1-14 protein; (15) SEQ ID NO: 18 corresponding to an immature OMP-1-15 protein; (16) SEQ ID NO: 19 corresponding to an immature OMP-1-16; (17) SEQ ID NO: 20 corresponding to an immature OMP-1-17 protein; (18) SEQ ID NO: 21 corresponding to an immature OMP-1-18 protein; and (19) SEQ ID NO: 22 corresponding to an immature OMP-1-19 protein.

Also provided herein is an isolated polynucleotide encoding an E. ewingii (EE) protein, or a functional derivative thereof. The EE protein is selected from the group consisting of: (1) a mature OMP-1-1 protein encoded by nucleotide 672-1484 of SEQ ID NO: 1; (2) a mature OMP-1-2 protein encoded by nucleotide 2116-2871 of SEQ ID NO: 1; (3) a mature OMP-1-3 protein encoded by nucleotide 3610-4395 of SEQ ID NO: 1; (4) a mature OMP-1-4 protein encoded by nucleotide 4486-5286 of SEQ ID NO: 1; (5) a mature OMP-1-5 protein encoded by nucleotide 5380-6129 of SEQ ID NO: 1; (6) a mature OMP-1-6 protein encoded by nucleotide 6216-7040 of SEQ ID NO: 1; (7) a mature OMP-1-7 protein encoded by nucleotide 7145-7921 of SEQ ID NO: 1; (8) a mature OMP-1-8 protein encoded by nucleotide 8032-8679 of SEQ ID NO: 1; (9) a mature OMP-1-9 protein encoded by nucleotide 8772-9536 of SEQ ID NO: 1; (10) a mature OMP-1-10 protein encoded by nucleotide 9620-10387 of SEQ ID NO: 1; (11) a mature OMP-1-11 protein encoded by nucleotide 10477-11268 of SEQ ID NO: 1; (12) a mature OMP-1-12 protein encoded by nucleotide 11370-12188 of SEQ ID NO: 1; (13) a mature OMP-1-13 protein encoded by nucleotide 12292-13113 of SEQ ID NO: 1; (14) a mature OMP-1-14 protein encoded by nucleotide 14530-15312 of SEQ ID NO: 1; (15) a mature OMP-1-15 protein encoded by nucleotide 15689-16450 of SEQ ID NO: 1; (16) a mature OMP-1-16 protein encoded by nucleotide 16861-17634 of SEQ ID NO: 1; (17) a mature OMP-1-17 protein encoded by nucleotide 18479-19222 of SEQ ID NO: 1; (18) a mature OMP-1-18 protein encoded by nucleotide 19558-20310 of SEQ ID NO: 1; and (19) a mature OMP-1-19 protein encoded by nucleotide 21188-21967 of SEQ ID NO: 1. The functional derivative should not have the sequence SEQ ID NO: 128, 130, 132, 134, 136; or any fragment thereof, and each functional derivative should have a specific binding affinity for an anti-E. ewingii antibody.

In some embodiments, the functional derivative encoded by the polynucleotide comprises a sequence which is at least 85%, 90%, 95% or 98% identical to the sequence (1)-(19), as described above.

In some embodiments, the functional derivative encoded by the polynucleotide comprises an immunoreactive fragment that has a length of from 6 amino acids to less than the full length of the EE protein and comprises 6 or more consecutive amino acids from the following sequences: (1) SEQ ID NO: 137-155; (2) SEQ ID NO: 156-173; (3) SEQ ID NO: 174-191; (4) SEQ ID NO: 192-208; (5) SEQ ID NO: 209-227; or (6) any combination of the sequences (1)-(5); wherein each immunoreactive fragment has a specific binding affinity for an anti-E. ewingii antibody.

In other embodiments, the EE protein encoded by the polynucleotide comprises a sequence selected from the group consisting of: (1) SEQ ID NO: 3 corresponding to an immature OMP-1-1 protein; (2) SEQ ID NO: 4 corresponding to an immature OMP-1-2 protein; (3) SEQ ID NO: 6 corresponding to an immature OMP-1-3 protein; (4) SEQ ID NO: 7 corresponding to an immature OMP-1-4 protein; (5) SEQ ID NO: 8 corresponding to an immature OMP-1-5 protein; (6) SEQ ID NO: 9 corresponding to an immature OMP-1-6 protein; (7) SEQ ID NO: 10 corresponding to an immature OMP-1-7 protein; (8) SEQ ID NO: 11 corresponding to an immature OMP-1-8 protein; (9) SEQ ID NO: 12 corresponding to an immature OMP-1-9 protein; (10) SEQ ID NO: 13 corresponding to an immature OMP-1-10 protein; (11) SEQ ID NO: 14 corresponding to an immature OMP-1-11 protein; (12) SEQ ID NO: 15 corresponding to an immature OMP-1-12 protein; (13) SEQ ID NO: 16 corresponding to an immature OMP-1-13 protein; (14) SEQ ID NO: 17 corresponding to an immature OMP-1-14 protein; (15) SEQ ID NO:18 corresponding to an immature OMP-1-15 protein; (16) SEQ ID NO: 19 corresponding to an immature OMP-1-16; (17) SEQ ID NO: 20 corresponding to an immature OMP-1-17 protein; (18) SEQ ID NO: 21 corresponding to an immature OMP-1-18 protein; (19) SEQ ID NO: 22 corresponding to an immature OMP-1-19 protein.

In one embodiment, the polynucleotide comprises a portion of the nucleotide sequence SEQ ID NO: 1.

The invention also relates to a kit for detecting antibodies specific for E. ewingii (EE), the kit comprising an isolated EE polypeptide as described herein.

Also provided herein is a method for detecting antibodies specific for E. ewingii (EE). The method includes: (a) contacting a test sample with one or more isolated EE polypeptides, as described herein, under conditions that allow polypeptide/antibody complexes to form; and (b) assaying for the formation of a complex between antibodies in the test sample and the one or more EE polypeptides; wherein the formation of the complex is an indication that antibodies specific for E. ewingii are present in the test sample.

Also provided are isolated antibodies having specific binding affinity to an isolated EE polypeptide, as described herein.

Also provided is an immunogenic composition comprising one or more isolated E. ewingii OMP proteins, or immunogenic fragments or variants thereof, or a fusion protein containing same, and a pharmaceutically acceptable carrier. Such a composition is capable of producing antibodies specific to E. ewingii in a subject to whom the immunogenic composition has been administered. The isolated E. ewingii OMP protein for use in such a composition is selected from the group consisting of: (1) amino acid 24 to 293 of SEQ ID NO 3 corresponding to a mature OMP-1-1 protein; (2) amino acid 22 to 272 of SEQ ID NO: 4 corresponding to a mature OMP-1-2 protein; (3) amino acid 24 to 284 of SEQ ID NO: 6 corresponding to a mature OMP-1-3 protein; (4) amino acid 28 to 293 of SEQ ID NO: 7 corresponding to a mature OMP-1-4 protein; (5) amino acid 24 to 272 of SEQ ID NO: 8 corresponding to a mature OMP-1-5 protein; (6) amino acid 26 to 299 of SEQ ID NO: 9 corresponding to a mature OMP-1-6 protein; (7) amino acid 27 to 284 of SEQ ID NO: 10 corresponding to a mature OMP-1-7 protein; (8) amino acid 29 to 243 of SEQ ID NO: 11 corresponding to a mature OMP-1-8 protein; (9) amino acid 28 to 281 of SEQ ID NO: 12 corresponding to a mature OMP-1-9 protein; (10) amino acid 26 to 280 of SEQ ID NO: 13 corresponding to a mature OMP-1-10 protein; (11) amino acid 28 to 290 of SEQ ID NO: 14 corresponding to a mature OMP-1-11 protein; (12) amino acid 27 to 298 of SEQ ID NO: 15 corresponding to a mature OMP-1-12 protein; (13) amino acid 30 to 302 of SEQ ID NO: 16 corresponding to a mature OMP-1-13 protein; (14) amino acid 26 to 285 of SEQ ID NO: 17 corresponding to a mature OMP-1-14 protein; (15) amino acid 26 to 278 of SEQ ID NO: 18 corresponding to a mature OMP-1-15 protein; (16) amino acid 26 to 282 of SEQ ID NO: 19 corresponding to a mature OMP-1-16 protein; (17) amino acid 26 to 272 of SEQ ID NO: 20 corresponding to a mature OMP-1-17 protein; (18) amino acid 33 to 282 of SEQ ID NO: 21 corresponding to a mature OMP-1-18 protein; and (19) amino acid 24 to 282 of SEQ ID NO: 22 corresponding to a mature OMP-1-19 protein.

In one embodiment, the fusion protein in such a composition comprises an isolated E. ewingii OMP protein, or immunogenic fragment or variant thereof, and an N-terminal or C-terminal peptide or tag.

BRIEF DESCRIPTION OF FIGURES

FIG. 1. Strategy of E. ewingii omp-1 cluster sequencing. E. chaffeensis omp-1 and E. canis p30 were aligned to design 21 pairs of degenerate primers. The OMP-1 multigene locus was divided into three fragments each composed of seven shorter fragments (A). The initial nested touch-down PCRs generated four specific sequences within fragments 1 and 2 (B). Two fragments were amplified by nested touchdown PCR within fragment 3 using the p28-19 sequence and degenerate primers. Specific primers were designed to close all gaps (C). Two poly A/T and G/C regions were cloned into a TA vector and sequenced. The final sequence (24,126 bp) was assembled using SeqMan program in the DNASTAR software.

FIG. 2. Schematic representation of the organization of the E. ewingii omp-1 gene cluster. Genes are represented as boxes with arrows indicating their orientation. omp-1s are shown in a horizontal shading pattern. Black, white, and gray boxes show tr1, unknown genes, and secA, respectively.

FIG. 3. Synteny analysis of the E. ewingii (Ee) omp-1 cluster relative to E. chaffeensis (Ech), E. canis (Eca), and E. ruminantium (Eru) by the Artemis comparison tool.

FIG. 4. Dot plot analysis of the E. ewingii omp-1 cluster (A), and the E. ewingii omp-1 cluster relative to E. chaffeensis (B), E. canis (C), or E. ruminantium (D). Repetitive regions consisting of multiple homologous DNA segments were analyzed using the web-based dot plot program (JDotter) (available at athena.bioc.uvic.ca/index.php. The window cutoff was set to the default. The α, β and γ repetitive regions described by Ohashi et al., 2001, Infect Immun 69:2083-2091, are marked by lines.

FIG. 5. Phylogram of OMP proteins of E. ewingii, E. chaffeensis, E. canis, and E. ruminantium. A total of 39 OMPs were segregated into 10 clusters with four or three Ehrlichia species each, but 40 remaining proteins were not. The tree was constructed using the Neighbor-Joining (NEIGHBOR program from PHYLIP) method based on the alignment generated with CLUSTAL V; 1000 bootstrap replications were performed. The nodes supported by bootstrap values greater than 60% are labeled. The OMPs encoded by the three repetitive regions in FIG. 4 are indicated by α, β1, and β2. Branch lengths are proportional to percent divergence. The calibration bar is on the lower left.

FIG. 6. ELISA analysis of E. ewingii- and E. chaffeensis-infected dogs with the 19 EEOMP-1 oligopeptides. Preifection control and post-infection plasma from dogs were allowed to react with the 19 synthesized EEOMP-1 specific peptides. The y-axis shows OD_(405nm)-OD_(492nm). A reaction was considered to be positive when the plasma from infected dogs yielded an OD_(405nm)-OD_(492nm) value greater than the mean OD_(405nm)-OD_(492nm) of preinfection control plasma+three standard deviations (dashed line with closed triangles). Representative data of 3-5 assays is shown. Reactivity ratios of E. ewingii/control plasma (EW/Control) and E. chaffeensis/control plasma (ECH/Control) were calculated based on the averages of three EW-positive and three ECH-positive samples, respectively, to four negative control samples. EEOMP-1 peptides that showed good sensitivity and specificity for detecting E. ewingii infection are underlined.

FIG. 7. Alignment of the 19 E. ewingii OMP-1 proteins SEQ ID NOS 228-246, respectively, in order of appearance).

FIG. 8. Alignment of E. ewingii OMP-1, E. canis P30. E. chaffeensis OMP-1, and E. ruminantium MAP1 proteins (SEQ ID NOS 247-323, respectively, in order of appearance).

DETAILED DESCRIPTION

The present invention will now be described with occasional reference to some specific embodiments disclosed herein. This invention may, however, be embodied in different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope to those skilled in the art.

Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. The terminology used in the description herein is for describing particular embodiments only and is not intended to be limiting. As used in the description and the appended claims, the singular forms “a,” “an,” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety.

Unless otherwise indicated, all numbers expressing quantities of ingredients, properties such as molecular weight, reaction conditions, and so forth as used in the specification and claims are to be understood as being modified in all instances by the term “about.” Accordingly, unless otherwise indicated, the numerical properties set forth in the following specification and claims are approximations that may vary depending on the desired properties sought to be obtained in embodiments of the present invention. Notwithstanding that the numerical ranges and parameters setting forth the broad scope are approximations, the numerical values set forth in the specific examples are reported as precisely as possible. Any numerical values, however, inherently contain certain errors necessarily resulting from error found in their respective measurements.

DEFINITIONS

A “polynucleotide” or “nucleic acid molecule,” as is generally understood and used herein, refers to a polymer of nucleotides, and includes but should not be limited to DNA and RNA.

“Recombinant DNA” is any DNA molecule formed by joining DNA segments from different sources and produced using “recombinant DNA” technology (also known as “molecular genetic engineering”).

A “DNA segment or fragment,” as is generally understood and used herein, refers to a molecule comprising a linear stretch of nucleotides wherein the nucleotides are present in a sequence that can encode, through the genetic code, a molecule comprising a linear sequence of amino acid residues that is referred to as a protein, a protein fragment or a polypeptide.

“Gene” refers to a DNA sequence related to a single polypeptide chain or protein, and as used herein includes the 5′ and 3′ untranslated ends. The polypeptide can be encoded by a full-length sequence or any portion of the coding sequence, so long as the functional activity (i.e. immuno-reactivity or immunogenicity) of the protein is retained.

“Complementary,” as used herein, refers to the natural binding of polynucleotides under permissive salt and temperature conditions by base pairing.

“Complementary DNA” or “cDNA” refers to recombinant nucleic acid molecules synthesized by reverse transcription of messenger RNA (“mRNA”).

Open Reading Frame (“ORF”). A series of codons (base triplets) which can be translated into a protein without any termination codons interrupting the relevant reading frames. An ORF can be evidence that a DNA sequence is part of a gene.

Restriction Endonuclease. A “restriction endonuclease” (also “restriction enzyme”) is an enzyme that has the capacity to recognize a specific base sequence (usually 4, 5, or 6 base pairs in length) in a DNA molecule, and to cleave the DNA molecule at every place where this sequence appears. For example, EcoRI recognizes the base sequence GAATTC/CTTAAG.

Restriction Fragment. The DNA molecules produced by digestion with a restriction endonuclease are referred to as “restriction fragments.” Any given genome can be digested by a particular restriction endonuclease into a discrete set of restriction fragments.

Agarose Gel Electrophoresis. To determine the length of restriction fragments, an analytical method for fractionating double-stranded DNA molecules on the basis of size is required. The most commonly used technique (though not the only one) for achieving such a fractionation is “agarose gel electrophoresis.” The principle of this method is that DNA molecules migrate through the gel as though it were a sieve that retards the movement of the largest molecules to the greatest extent and the movement of the smallest molecules to the least extent. Note that the smaller the DNA fragment, the greater the mobility under electrophoresis in the agarose gel.

The DNA fragments fractionated by “agarose gel electrophoresis” can be visualized directly by a staining procedure if the number of fragments included in the pattern is small. The DNA fragments of genomes can be visualized successfully. However, most genomes, including the human genome, contain far too many DNA sequences to produce a simple pattern of restriction fragments. For example, the human genome is digested into approximately 1,000,000 different DNA fragments by EcoRI. In order to visualize a small subset of these fragments, a methodology referred to as the Southern hybridization procedure can be applied.

Southern Transfer Procedure. The purpose of the “Southern transfer procedure” (also “Southern blotting”) is to physically transfer DNA fractionated by agarose gel electrophoresis onto a nitrocellulose filter paper or another appropriate surface or method, while retaining the relative positions of DNA fragments resulting from the fractionation procedure. The methodology used to accomplish the transfer from agarose gel to nitrocellulose involves drawing the DNA from the gel into the nitrocellulose paper by capillary action or electrophoretic transfer.

Nucleic Acid Hybridization. “Nucleic acid hybridization” depends on the principle that two single-stranded nucleic acid molecules that have complementary base sequences will reform the thermodynamically favored double-stranded structure if they are mixed under the proper conditions. The double-stranded structure will be formed between two complementary single-stranded nucleic acids even if one is immobilized on a nitrocellulose filter. In the Southern hybridization procedure, the latter situation occurs. As noted previously, the DNA of the test sample to be tested is digested with a restriction endonuclease, fractionated by agarose gel electrophoresis, converted to the single-stranded form, and transferred to nitrocellulose paper, making it available for reannealing to the hybridization probe. Examples of hybridization conditions can be found in Ausubel, F. M. et al., Current Protocols in Molecular Biology, John Wiley & Sons, Inc., New York, N.Y. (1989). For example, a nitrocellulose filter is incubated overnight at 68° C. with labeled probe in a solution containing 50% formamide, high salt (either 5×SSC[20×: 3M NaCl/0.3M trisodium citrate] or 5×SSPE [20×: 3.6M NaCl/0.2M NaH₂PO₄/0.02M EDTA, pH 7.7]), 5×Denhardt's solution, 1% SDS, and 100 μg/ml denatured salmon sperm DNA. This is followed by several washes in 0.2×SSC/0.1% SDS at a temperature selected based on the desired stringency: room temperature (low stringency), 42° C. (moderate stringency) or 68° C. (high stringency). The temperature selected is determined based on the melting temperature (Tm) of the DNA hybrid.

Hybridization Probe. To visualize a particular DNA sequence in the Southern hybridization procedure, a labeled DNA molecule or “hybridization probe” is reacted to the fractionated DNA bound to the nitrocellulose filter. The areas on the filter that carry DNA sequences complementary to the labeled DNA probe become labeled themselves as a consequence of the reannealing reaction. The areas of the filter that exhibit such labeling are visualized. The hybridization probe is generally produced by molecular cloning of a specific DNA sequence.

A “purified” or “isolated” polypeptide or nucleic acid is a polypeptide or nucleic acid that has been separated from a cellular component. Purified or isolated polypeptides or nucleic acids have been purified to a level of purity not found in nature.

A “mutation” is any detectable change in the genetic material which can be transmitted to daughter cells and possibly even to succeeding generations giving rise to mutant cells or mutant individuals. If the descendants of a mutant cell give rise only to somatic cells in multicellular organisms, a mutant spot or area of cells arises. “Mutations” in the germ line of sexually reproducing organisms can be transmitted by the gametes to the next generation resulting in an individual with the new mutant condition in both its somatic and germ cells. A “mutation” can be any (or a combination of) detectable, unnatural change affecting the chemical or physical constitution, mutability, replication, phenotypic function, or recombination of one or more deoxyribonucleotides; nucleotides can be added, deleted, substituted for, inverted, or transposed to new positions with and without inversion. “Mutations” can occur spontaneously and can be induced experimentally by application of mutagens.

Oligonucleotide or Oligomer. A molecule comprised of two or more deoxyribonucleotides or ribonucleotides, preferably more than three. Its exact size will depend on many factors, which in turn depend on the ultimate function or use of the “oligonucleotide.” An “oligonucleotide” can be derived synthetically or by cloning.

Sequence Amplification. A method for generating large amounts of a target sequence. In general, one or more amplification primers are annealed to a nucleic acid sequence. Using appropriate enzymes, sequences found adjacent to, or in between the primers are amplified.

Amplification Primer or “Primer”. An oligonucleotide which is capable of annealing adjacent to a target sequence and serving as an initiation point for DNA synthesis when placed under conditions in which synthesis of a primer extension product which is complementary to a nucleic acid strand is initiated.

Vector. A plasmid or phage DNA or other DNA sequence into which DNA can be inserted to be cloned. The “vector” can replicate autonomously in a host cell, and can be further characterized by one or a small number of endonuclease recognition sites at which such DNA sequences can be cut in a determinable fashion and into which DNA can be inserted. The “vector” can further contain a marker suitable for use in the identification of cells transformed with the “vector”. Markers, for example, are tetracycline resistance or ampicillin resistance. The words “cloning vehicle” are sometimes used for “vector.”

Expression. “Expression” is the process by which a structural gene produces a polypeptide. It involves transcription of the gene into mRNA, and the translation of such mRNA into polypeptide(s).

Expression Vector. A vector or vehicle similar to a cloning vector but which is capable of expressing a gene which has been cloned into it, after transformation into a host. The cloned gene is usually placed under the control of (i.e., operably linked to) certain control sequences such as promoter sequences.

Expression control sequences will vary depending on whether the vector is designed to express the operably linked gene in a prokaryotic or eukaryotic host and can additionally contain transcriptional elements such as enhancer elements, termination sequences, tissue-specificity elements, and/or translational initiation and termination sites.

EMBODIMENTS

The omp-1 gene cluster sequence of E. ewingii (SEQ ID NO: 1) contains 23 open reading frames (ORFs), as outlines in Table 1. The ORFs encode E. ewingii (EE) outer membrane proteins (OMPs) OMP-1-1, OMP-1-2, OMP-1-3, OMP-1-4, OMP-1-5, OMP-1-6, OMP-1-7, OMP-1-8, OMP-1-9, OMP-1-10, OMP-1-11, OMP-1-12, OMP-1-13, OMP-1-14, OMP-1-15, OMP-1-16, OMP-1-17, OMP-1-18, OMP-1-19, as well as the two proteins UN2 and UN3. As used herein, the term “EE proteins” and “EE OMPs” are used interchangeably and refer to the above mentioned 19 EE OMPs of E. ewingii in their mature (i.e. lacking the signal peptide) as well as immature (i.e. including the signal peptide) forms, as listed in Table 1.

“EE polypeptides” include EE proteins and functional derivatives of the EE proteins, as well as fusion proteins made from such proteins or their functional derivatives. The invention also relates to isolated polynucleotides encoding EE polypeptides, probes, primers, antibodies and methods of their production, compositions containing one or more of the above mentioned molecules, as well as methods of using the probes, primers, EE polypeptides and antibodies for the purpose of diagnosis, screening, therapy and production of vaccines against E. ewingii.

Isolated E. ewingii Polypeptides

The 24-kb omp-1 gene locus contains 23 open reading frames (ORFs), numbered ORF 1 to 23 (see Table 1). These 23 ORFs are arranged in tandem except for three ORFs (ORF 19, 20, and 21) that are in the opposite orientation (i.e. the complementary strand encodes the OMP-1-17, OMP-1-18 and OMP-1-19 proteins, respectively). Nineteen of these 23 ORFs encode outer membrane proteins (EE proteins) enumerated as E. ewingii (EE)OMP-1-1 to EEOMP-1-19. Two ORFs encode the proteins UN2 and UN3. The mature OMP-1 proteins of E. ewingii have a molecular mass of about 25.1 to 31.3 kDa; and isoelectric points of 5.03 to 9.80. The properties of the polypeptides encoded by the ORFs of the E. ewingii omp-1 gene cluster, including signal peptide lengths, molecular mass, and isoelectric points of mature proteins, as well as the sequence identifiers for each protein, are shown in Table 1.

TABLE 1 Properties of E. ewingii proteins Length (bp) Upstream (based on the signal SEQ intergenic omp-1 nucleotide peptide Molecular OMP-1 ORF ID space sequence SEQ ID AA AA Mass^(a) number number NO (bp) NO: 1) number number (Da) PI^(a) NA ORF-1 2 NA 357 (3-359) 118 NA 13687.9^(b) 7.93^(b) Hypothetical transcriptional regulator OMP-1-1 ORF-2 3 244 882 (603-1484) 293 23 29848.6 6.38 OMP-1-2 ORF-3 4 568 819 (2053-2871) 272 21 28125.9 8.61 NA ORF-4 5 101 558 (2973-3530) 185 32 18000.7 9.80 UN2 OMP-1-3 ORF-5 6 10 855 (3541-4395) 284 23 29372.6 7.07 OMP-1-4 ORF-6 7 6 882 (4405-5286) 293 27 30194.6 9.47 OMP-1-5 ORF-7 8 24 819 (5311-6129) 272 23 28174.6 5.39 OMP-1-6 ORF-8 9 11 900 (6141-7040) 299 25 30194.6 9.47 OMP-1-7 ORF-9 10 26 855 (7067-7921) 284 26 28864.7 5.43 OMP-1-8 ORF-10 11 26 732 (7948-8679) 243 28 25098.4 7.88 OMP-1-9 ORF-11 12 11 846 (8691-9536) 281 27 28661.4 7.24 OMP-1- ORF-12 13 8 843 (9545-10387) 280 25 28476.4 6.25 10 OMP-1- ORF-13 14 8 867 (10396-11268) 290 27 29488.9 5.90 11 OMP-1- ORF-14 15 23 897 (11292-12188) 298 26 30036.8 5.89 12 OMP-1- ORF-15 16 16 909 (12205-13113) 302 29 31275.1 5.31 13 OMP-1- ORF-16 17 1343 858 (14455-15312) 285 25 28864.7 5.03 14 OMP-1- ORF-17 18 301 837 (15614-16450) 278 25 28862.8 5.58 15 OMP-1- ORF-18 19 335 849 (16786-17634) 282 25 28476.0 5.41 16 OMP-1- ORF-19 20 769 819 (18404-19222) 272 25 27782.3 6.50 17 OMP-1- ORF-20 21 539 849 (19462-20310) 282 32 28603.4 5.08 18 OMP-1- ORF-21 22 808 849 (21119-21967) 282 23 30505.0 7.88 19 NA ORF-22 23 1082 408 (23050-23457) 135 NA 14800.0 3.85 UN3 NA ORF-23 24 390 285 (23848-24126) 93 NA 10533.3^(b) 9.40^(b) SecA

In one aspect, the invention relates to isolated polypeptides comprising an amino acid sequence corresponding to EE proteins, or functional derivatives thereof. The isolated polypeptides of the invention expressly exclude a peptide that is the 505-bp E. ewingii p28-1 peptide deposited in GenBank accession numbers: AF287961 (SEQ ID NOS 127-128), AF287962 (SEQ ID NOS 129-130), AF287963 (SEQ ID NOS 131-132), AF287964 (SEQ ID NOS 133-134, AF287966 (SEQ ID NOS 135-136); or any fragment thereof.

The EE proteins include the immature (i.e. with the signal peptide) as well as the mature form (lacking the signal peptide) proteins of E. Ewingii, as described below and in Table 1.

In one embodiment, the polypeptide comprises the sequence of an immature OMP-1-1 protein having the amino acid sequence SEQ ID NO: 3, encoded by nucleotide 603-1484 of SEQ ID NO:1. In another embodiment, the polypeptide comprises the sequence of a mature OMP-1-1 protein having the amino acid sequence from residue 24 to 293 of SEQ ID NO: 3, encoded by nucleotide 672-1484 of SEQ ID NO: 1.

In one embodiment, the polypeptide comprises the sequence of an immature OMP-1-2 protein having the amino acid sequence SEQ ID NO: 4, encoded by nucleotide 2053-2871 of SEQ ID NO: 1. In another embodiment, the polypeptide comprises the sequence of a mature OMP-1-2 protein having the amino acid sequence from residue 22 to 272 of SEQ ID NO: 4, encoded by nucleotide 2116-2871 of SEQ ID NO: 1.

In another embodiment, the polypeptide comprises the sequence of an immature OMP-1-3 protein having the amino acid sequence SEQ ID NO: 6, encoded by nucleotide 3541-4395 of SEQ ID NO: 1. In another embodiment, the polypeptide comprises the sequence of a mature OMP-1-3 protein having the amino acid sequence from residue 24-284 of SEQ ID NO: 6, encoded by nucleotide 3610-4395 of SEQ ID NO: 1.

In another embodiment, the polypeptide comprises the sequence of an immature OMP-1-4 protein having the amino acid sequence SEQ ID NO: 7, encoded by nucleotide 4405-5286 of SEQ ID NO: 1. In another embodiment, the polypeptide comprises the sequence of a mature OMP-1-2 protein having the amino acid sequence from residue 28 to 293 of SEQ ID NO: 7, encoded by nucleotide 4486-5286 of SEQ ID NO: 1.

In another embodiment, the polypeptide comprises the sequence of an immature OMP-1-5 protein having the amino acid sequence SEQ ID NO: 8, encoded by nucleotide 5311-6129 of SEQ ID NO: 1. In another embodiment, the polypeptide comprises the sequence of a mature OMP-1-5 protein having the amino acid sequence from residue 24 to 272 of SEQ ID NO: 8, encoded by nucleotide 5380-6129 of SEQ ID NO: 1.

In another embodiment, the polypeptide comprises the sequence of an immature OMP-1-6 protein having the amino acid sequence SEQ ID NO: 9, encoded by nucleotide 6141-7040 of SEQ ID NO: 1. In another embodiment, the polypeptide comprises the sequence of a mature OMP-1-6 protein having the amino acid sequence from residue 26 to 299 of SEQ ID NO: 9, encoded by nucleotide 6216-7040 of SEQ ID NO:1.

In another embodiment, the polypeptide comprises the sequence of an immature OMP-1-7 protein having the amino acid sequence SEQ ID NO: 10, encoded by nucleotide 7067-7921 of SEQ ID NO: 1. In another embodiment, the polypeptide comprises the sequence of a mature OMP-1-7 protein having the amino acid sequence from residue 27 to 284 of SEQ ID NO: 10, encoded by nucleotide 7145-7921 of SEQ ID NO: 1.

In another embodiment, the polypeptide comprises the sequence of an immature OMP-1-8 protein having the amino acid sequence SEQ ID NO: 1, encoded by nucleotide 7948-8679 of SEQ ID NO: 1. In another embodiment, the polypeptide comprises the sequence of a mature OMP-1-8 protein having the amino acid sequence from residue 29 to 243 of SEQ ID NO: 11, encoded by nucleotide 8032-8679 of SEQ ID NO: 1.

In another embodiment, the polypeptide comprises the sequence of an immature OMP-1-9 protein having the amino acid sequence SEQ ID NO: 12, encoded by nucleotide 8691-9536 of SEQ ID NO: 1. In another embodiment, the polypeptide comprises the sequence of a mature OMP-1-9 protein having the amino acid sequence from residue 28 to 281 of SEQ ID NO: 12, encoded by nucleotide 8772-9536 of SEQ ID NO: 1.

In another embodiment, the polypeptide comprises the sequence of an immature OMP-1-10 protein having the amino acid sequence SEQ ID NO: 13, encoded by nucleotide 9545-10387 of SEQ ID NO:1. In another embodiment, the polypeptide comprises the sequence of a mature OMP-1-10 protein having the amino acid sequence from residue 26 to 280 of SEQ ID NO: 13, encoded by nucleotide 9620-10387 of SEQ ID NO: 1.

In another embodiment, the polypeptide comprises the sequence of an immature OMP-1-11 protein having the amino acid sequence SEQ ID NO: 14, encoded by nucleotide 10396-11268 of SEQ ID NO: 1. In another embodiment, the polypeptide comprises the sequence of a mature OMP-1-111 protein having the amino acid sequence from residue 28 to 290 of SEQ ID NO: 14, encoded by encoded by nucleotide 10477-11268 of SEQ ID NO: 1.

In another embodiment, the polypeptide comprises the sequence of an immature OMP-1-12 protein having the amino acid sequence SEQ ID NO: 15, encoded by nucleotide 11292-12188 of SEQ ID NO: 1. In another embodiment, the polypeptide comprises the sequence of a mature OMP-1-12 protein having the amino acid sequence from residue 27 to 298 of SEQ ID NO: 15, encoded by nucleotide 11370-12188 of SEQ ID NO: 1.

In another embodiment, the polypeptide comprises the sequence of an immature OMP-1-13 protein having the amino acid sequence SEQ ID NO: 16, encoded by nucleotide 12205-13113 of SEQ ID NO:1. In another embodiment, the polypeptide comprises the sequence of a mature OMP-1-13 protein having the amino acid sequence from residue 30 to 302 of SEQ ID NO: 16, encoded by nucleotide 12292-13113 of SEQ ID NO:1.

In another embodiment, the polypeptide comprises the sequence of an immature OMP-1-14 protein having the amino acid sequence SEQ ID NO: 17, encoded by nucleotide 14455-15312 of SEQ ID NO:1. In another embodiment, the polypeptide comprises the sequence of a mature OMP-1-14 protein having the amino acid sequence from residue 26 to 285 of SEQ ID NO: 17, encoded by nucleotide 14530-15312 of SEQ ID NO: 1.

In another embodiment, the polypeptide comprises the sequence of an immature OMP-1-15 protein having the amino acid sequence SEQ ID NO: 18, encoded by nucleotide 15614-16450 of SEQ ID NO:1. In another embodiment, the polypeptide comprises the sequence of a mature OMP-1-15 protein having the amino acid sequence from residue 26 to 278 of SEQ ID NO: 18, encoded by nucleotide 15689-16450 of SEQ ID NO: 1.

In another embodiment, the polypeptide comprises the sequence of an immature OMP-1-16 protein having the amino acid sequence SEQ ID NO: 19, encoded by nucleotide 16786-17634 of SEQ ID NO:1. In another embodiment, the polypeptide comprises the sequence of a mature OMP-1-16 protein having the amino acid sequence from residue 26 to 282 of SEQ ID NO: 19, encoded by nucleotide 16861-17634 of SEQ ID NO: 1.

In another embodiment, the polypeptide comprises the sequence of an immature OMP-1-17 protein having the amino acid sequence SEQ ID NO: 20, encoded by nucleotide 18404-19222 of SEQ ID NO:1. In another embodiment, the polypeptide comprises the sequence of a mature OMP-1-17 protein having the amino acid sequence from residue 26 to 272 of SEQ ID NO: 20, encoded by nucleotide 18479-19222 of SEQ ID NO: 1.

In another embodiment, the polypeptide comprises the sequence of an immature OMP-1-18 protein having the amino acid sequence SEQ ID NO: 21, encoded by nucleotide 19462-20310 of SEQ ID NO:1. In another embodiment, the polypeptide comprises the sequence of a mature OMP-1-18 protein having the amino acid sequence from residue 33 to 282 of SEQ ID NO: 21, encoded by nucleotide 19558-20310 of SEQ ID NO: 1.

In another embodiment, the polypeptide comprises the sequence of an immature OMP-1-19 protein having the amino acid sequence SEQ ID NO: 22, encoded by nucleotide 21119-21967 of SEQ ID NO:1. In another embodiment, the polypeptide comprises the sequence of a mature OMP-1-19 protein having the amino acid sequence from residue 24 to 282 of SEQ ID NO: 22, encoded by nucleotide 21188-21967 of SEQ ID NO: 1.

Each EE OMP-1 protein has several conserved regions, with amino acid sequences that are more or less conserved between the nineteen EE OMP-1 proteins. Each EE OMP-1 protein also includes 4 variable loops, referred to hereinafter as loops 1-4, that have variable amino sequences between the nineteen EE OMP-1 proteins (FIG. 7). Tables 2-5 shows the amino acid sequence of each of the variable loops in the EE OMP-1 proteins.

TABLE 2 Sequences of variable region loop 1 in EE OMP-1-1 to OMP-1-19 SEQ From aa ID residue # to OMP# Loop 1 NO: aa residue # OMP1-1 SPIPIDFSNESEMV 137 24 to 37 OMP1-2 QGLNDNIFKN 138 22 to 31 OMP1-3 LLALENNLSGGVGHI 139 21-35 OMP1-4 SFVFSMQGRSNIT 140 23-35 OMP1-5 VLIDTTEKDYASNV 141 24-37 OMP1-6 AISIDNNIIDQNL 142 25-37 OMP1-7 PISNNSEDNIF 143 28-38 OMP1-8 LNNAEDHKD 144 31-39 OMP1-9 ESNHYDKSL 145 28-36 OMP1-10 EVITHNDNKHPGI 146 26-38 OMP1-11 AKNNYSYINPVL 147 27-38 OMP1-12 ETTITNQPSGL 148 27-37 OMP1-13 ETIVDDIDRQFRL 149 30-42 OMP1-14 ADPMNSNDVSINDSKE 150 25-40 OMP1-15 LVSFIPCI 151 15-22 OMP1-16 FICELPGV 152 15-22 OMP1-17 DVVVSEEKR 153 26-34 OMP1-18 FYASMSFGMSNTLANQ 154 19-39 VSPIS OMP1-19 LVSDASDSHTKSVSL 155 26-40

TABLE 3 Sequences of variable region loop 2 in EE OMP-1-1 to OMP-1-19 SEQ From aa ID residue # to OMP # Loop 2 NO: aa residue # OMP1-1 YKSTGNSEADKSEKELTLFTLKESTQ 156 59-93 APDFTKKET OMP1-2 SKDTIGIFALKKDASLPTDIKKNS 157 54-77 OMP1-3 MEEATIGAVIPKSLKQDAEDITLSIL 158 53-82 ALST OMP1-4 FDTKDPIGLIRSARSTEPSVLKINTH 159 60-85 OMP1-5 SKTKNSIALEKPIESNSNILKS 160 60-81 OMP1-6 KKVDLIALKNDVTHITEEILKDP 161 62-84 OMP1-7 FATQKLMRVKKDSKEGLPNILKSKD 162 63-87 OMP1-8 CIIRLITVKDSHFFSINTSSYNLCLE 163 54-85 KHKNDI OMP1-9 DINTKGLFKLGHGVTLVEEDIKNHLQ 164 58-83 OMP1-10 ATTVQLVGLNYTAAPIDDIKTSSK 165 61-84 OMP1-11 HYDTQLLAELKKEVGSVTNTVIQAYA 166 60-94 NYNVPSQAP OMP1-12 VATKHLIALKKSVDSINAEKATPHNQ 167 60-91 GLGKPD OMP1-13 VTTKYLTALKKDADPTEKTGSTPHEK 168 65-96 GLGKPD OMP1-14 PIEGAISPTKKVLGLNKGGSIANSHD 169 64-95 FSKIDP OMP1-15 DTIETIATFGLSKTYNRSSPIHSDFT 170 58-86 DSK OMP1-16 — — — OMP1-17 IPGLTKKIFALSYDATDITKETSFK 171 58-84 QA OMP1-18 QILHDVATERVVGLKHDLLESADKLV 172 60-96 DNLYNFDLSED OMP1-19 LTSGIIANKRVLGLKNDILINADEAI 173 61-90 KNLS

TABLE 4 Sequences of variable region loop 3 in EE OMP-1-1 to OMP-1-19 SEQ From aa ID residue # to OMP # Loop 3-1 NO aa residue # OMP1-1 DKQKHTHPDNH 174 136-146 OMP1-2 EGYKJTGVEQH 175 122-132 OMP1-3 VSAPSGYDDNIYAYSI 176 123-139 OMP1-4 IKRLVNYASRDGH 177 129-141 OMP1-5 ELNSSSLISSNNHYTQLYE 178 126-144 OMP1-6 ITDCSNCTIN 179 127-136 OMP1-7 KDPKDCSVKDAFRHL 180 130-144 OMP1-8 TEDKYLTSEQEVNDY 181 120-134 OMP1-9 DLKNCTIQ 182 126-133 OMP1-10 TDPGNYTIK 183 127-135 OMP1-11 KNSGHSSIDAHR 184 136-147 OMP1-12 TLNDAF 185 130-135 OMP1-13 TISNAF 186 135-140 OMP1-14 KYYGLFREGTPQEEEH 187 146-161 OMP1-15 SNGAHM 188 121-126 OMP1-16 — — — OMP1-17 QFYREGSNNYKF 189 123-134 OMP1-18 VQDTKSHIVDDNYR 190 121-134 OMP1-19 RDTKNHIIDNN 191 134-144 SEQ From aa ID residue # to OMP # Loop 3-2 NO: aa residue # OMP1-1 SCTEQEMKPAQQNGSSKDGN 192 151-170 OMP1-2 LDTNGNQPKTDK 193 139-150 OMP1-3 SIEVPQLRSLPYHYT 194 138-152 OMP1-4 IPRDTFFNNSIPYAFNA 195 146-162 OMP1-5 ANFQNFATSR 196 145-154 OMP1-6 KDNNQVQPKAHDSSSTDSNNSSNNTK 197 148-175 KSYFTF OMP1-7 LDTGLSMPKEKK 198 150-161 OMP1-8 VNDYNIISAI 199 131-140 OMP1-9 ICKENDKPTPKEKK 200 145-158 OMP1-10 MNSSSNNQPKDKQFT 201 146-160 OMP1-11 HSNNGNTQQNPFA 202 154-166 OMP1-12 IESDQNKFQPKNANSNSSNKIYHT 203 154-177 OMP1-13 SESSKEPQPKNPNSAGNNKIFHT 204 159-181 OMP1-14 — — — OMP1-15 KDNANIGTTPQDKK 205 143-156 OMP1-16 — — — OMP1-17 ETITSKKF 206 141-148 OMP1-18 HGPAKHIN 207 152-159 OMP1-19 SKQDNLNSD 208 151-159

TABLE 5 Sequences of variable region loop 4 in EE OMP-1-1 to OMP-1-19 SEQ From aa ID residue # to OMP # Loop 4 NO aa residue # OMP1-1 TVQYPVKLTSPPTHIDPVVYFHSD 209 258-281 OMP1-2 NYPTDNNTTKTTVSAI 210 241-256 OMP1-3 LLDYPSYYRSLTSLSDNDPNRILPF 211 238-262 OMP1-4 PLMLSPSTPRRRIPPQSSSEVQDATG 212 249-276 LL OMP1-5 YTQYVSGINSLQEI 213 234-247 OMP1-6 TYAYILKDS 214 266-274 OMP1-7 NHVVELDDF 215 251-259 OMP1-8 SKIHYAIILSNNKYLQNSLGDTKTN 216 208-234 TY OMP1-9 QNMFDSNE 217 249-256 OMP1-10 QHVVTLDT 218 248-255 OMP1-11 QYVNTTTSQAIN 219 254-265 OMP1-12 QHIAELNDA 220 265-273 OMP1-13 QHVAELNDD 221 269-277 OMP1-14 KTPVTLDTAPQT 222 252-263 OMP1-15 DITPLKPNGIENTTATHVLV 223 242-256 OMP1-16 DIATILPSGSSIKDNQY 224 250-262 OMP1-17 YERVEIAYHPSIEEA 225 229-245 OMP1-18 EYSNIPVQYPRNLFYA 226 242-257 OMP1-19 QYSSISVKYPKVLVFPSTRS 227 242-261

Also provided herein are functional derivatives of the EE proteins enumerated above. A “functional derivative” of an EE protein or peptide sequence is a molecule that possesses immunoreactivity to EE antibodies that is substantially similar to that of the corresponding EE protein or peptide, i.e. an “immunoreactive” functional derivative is a polypeptide that has a specific binding affinity for anti-E. ewingii antibodies. The term “specific binding affinity” refers to binding with an affinity for the EE antibodies that is substantially greater than the binding affinity for E. canis, and/or E. chaffeensis, and/or E. ruminantium.

The functional derivatives of an EE protein can be identified using any of a variety of routine assays for detecting peptide antigen-antibody complexes, the presence of which is an indicator of selective binding. Such assays include, without limitation, enzyme-linked immunosorbent assays (ELISA), radioimmunoassays, western blotting, enzyme immunoassays, fluorescence immunoassays, luminescent immunoassays and the like. Methods for detecting a complex between a peptide and an antibody, and thereby determining if the peptide is an “immunoreactive functional derivative” are well known to those skilled in the art and are described, for example, in Example 1, as well as in ANTIBODIES: A LABORATORY MANUAL (Edward Harlow & David Lane, eds., Cold Spring Harbor Laboratory Press, 2.sup.nd ed. 1998a); and USING ANTIBODIES: A LABORATORY MANUAL: PORTABLE PROTOCOL No. I (Edward Harlow & David Lane, Cold Spring Harbor Laboratory Press, 1998b), which are hereby incorporated by reference in their entirety.

In one embodiment, the “specific binding affinity” of a functional derivative is defined as an ELISA assay result, where the ratio of E. chaffeensis or E. canis plasma reactivity/control plasma reactivity is ˜1.00, and where E. ewingii plasma reactivity yields an OD_(405nm)-OD_(492nm) value greater than the mean OD_(405nm)-OD_(492nm) of preinfection control plasma+three standard deviations.

Thus, the terms “functional derivative” and “immunoreactive functional derivative” are used interchangeably and refer to peptides and proteins that can function in substantially the same manner as the EE proteins or peptides disclosed herein, and can be substituted for the E. ewingii proteins or peptides in any aspect of the present invention.

A “functional derivative” of a protein or peptide can contain post-translational modifications such as covalently linked carbohydrate, depending on the necessity of such modifications for the performance of a specific function. The term “functional derivative” is intended to include the immunoreactive “variants” and “fragments” of the EE proteins.

A “variant” of an EE protein refers to a molecule substantially similar in structure and immunoreactivity to the EE protein. Thus, provided that two molecules possess a common immunoactivity and can substitute for each other, they are considered “variants” as that term is used herein even if the composition or secondary, tertiary, or quaternary structure of one of the molecules is not identical to that found in the other, or if the amino acid or nucleotide sequence is not identical. Thus, in one embodiment, a variant refers to a protein whose amino acid sequence is similar to the amino acid sequences of a mature EE protein, hereinafter referred to as the reference amino acid sequence, but does not have 100% identity with the respective reference sequence. The variant protein has an altered sequence in which one or more of the amino acids in the reference sequence is deleted or substituted, or one or more amino acids are inserted into the sequence of the reference amino acid sequence. As a result of the alterations, the variant protein has an amino acid sequence which is at least 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to the reference sequence. For example, variant sequences which are at least 95% identical have no more than 5 alterations, i.e. any combination of deletions, insertions or substitutions, per 100 amino acids of the reference sequence. Percent identity is determined by comparing the amino acid sequence of the variant with the reference sequence using any available sequence alignment program. An example includes the MEGALIGN project in the DNA STAR program. Sequences are aligned for identity calculations using the method of the software basic local alignment search tool in the BLAST network service (the National Center for Biotechnology Information, Bethesda, Md.) which employs the method of Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. (1990) J. Mol. Biol. 215, 403-410. Identities are calculated by the Align program (DNAstar, Inc.) In all cases, internal gaps and amino acid insertions in the candidate sequence as aligned are not ignored when making the identity calculation.

In some embodiments, the variant proteins include all or substantially all of the amino acid sequences of the variable loops, presented in Tables 2-5. In these embodiments, the amino acid changes are made in areas other than the variable loops of Tables 2-5. Thus, for example, one or more amino acids of the conservative regions located between the variable loops can be changed without significantly altering the immunoreactivity of the resultant variant protein.

Variants of the EE proteins can include nonconservative as well as conservative amino acid substitutions. A conservative substitution is one in which the substituted amino acid has similar structural or chemical properties with the corresponding amino acid in the reference sequence. By way of example, conservative amino acid substitutions involve substitution of one aliphatic or hydrophobic amino acids, e.g. alanine, valine, leucine and isoleucine, with another; substitution of one hydroxyl-containing amino acid, e.g. serine and threonine, with another; substitution of one acidic residue, e.g. glutamic acid or aspartic acid, with another; replacement of one amide-containing residue, e.g. asparagine and glutamine, with another; replacement of one aromatic residue, e.g. phenylalanine and tyrosine, with another; replacement of one basic residue, e.g. lysine, arginine and histidine, with another; and replacement of one small amino acid, e.g., alanine, serine, threonine, methionine, and glycine, with another.

The alterations are designed not to abolish the immunoreactivity of the variant EE protein with antibodies that bind to the reference protein. Guidance in determining which amino acid residues may be substituted, inserted or deleted without abolishing such immunoreactivity of the variant protein are found using computer programs well known in the art, for example, DNASTAR software.

Preparation of an EE protein variant in accordance herewith can be achieved by site-specific mutagenesis of DNA that encodes an earlier prepared variant or a nonvariant version of the protein. Site-specific mutagenesis allows the production of EE protein variants through the use of specific oligonucleotide sequences that encode the DNA sequence of the desired mutation. In general, the technique of site-specific mutagenesis is well known in the art, as exemplified by publications such as Adelman et al., DNA 2:183 (1983) and Ausubel et al. “Current Protocols in Molecular Biology”, J. Wiley & Sons, NY, N.Y., 1996. As will be appreciated, the site-specific mutagenesis technique can employ a phage vector that exists in both a single-stranded and double-stranded form. Typical vectors useful in site-directed mutagenesis include vectors such as the M13 phage, for example, as disclosed by Messing et al., Third Cleveland Symposium on Macromolecules and Recombinant DNA, Editor A. Walton, Elsevier, Amsterdam (1981). These phage are readily commercially available and their use is generally well known to those skilled in the art. Alternatively, plasmid vectors that contain a single-stranded phage origin of replication (Vieira et al., Meth. Enzymol. 153:3 (1987)) can be employed to obtain single-stranded DNA.

In general, site-directed mutagenesis in accordance herewith is performed by first obtaining a single-stranded vector that includes within its sequence a DNA sequence that encodes the relevant protein. An oligonucleotide primer bearing the desired mutated sequence is prepared, generally synthetically, for example, by the method of Crea et al., Proc. Natl. Acad. Sci. (USA) 75:5765 (1978). This primer is then annealed with the single-stranded protein-sequence-containing vector, and subjected to DNA-polymerizing enzymes such as E. coli polymerase I Klenow fragment, to complete the synthesis of the mutation-bearing strand. Thus, a mutated sequence and the second strand bears the desired mutation. This heteroduplex vector is then used to transform appropriate cells and clones are selected that include recombinant vectors bearing the mutated sequence arrangement. After such a clone is selected, the mutated protein region can be removed and placed in an appropriate vector for protein production, generally an expression vector of the type that can be employed for transformation of an appropriate host.

Some deletions and insertions, and substitutions are not expected to produce radical changes in the characteristics of EE proteins. However, when it is difficult to predict the exact effect of the substitution, deletion, or insertion in advance of doing so, one skilled in the art will appreciate that the effect will be evaluated by routine screening assays. For example, a variant typically is made by site-specific mutagenesis of the native EE OMP-encoding nucleic acid, expression of the variant nucleic acid in recombinant cell culture, and, optionally, purification from the cell culture, for example, by immunoaffinity adsorption on a column (to absorb the variant by binding it to at least one remaining immune epitope). The activity of the cell lysate or purified EE OMP molecule variant is then screened in a suitable screening assay for the desired characteristic. For example, a change in the immunological character of the OMP molecule, such as affinity for a given antibody, is measured by a competitive type immunoassay. Changes in immunomodulation activity are measured by the appropriate assay. Modifications of such protein properties as redox or thermal stability, hydrophobicity, susceptibility to proteolytic degradation or the tendency to aggregate with carriers or into multimers are assayed by methods well known to the ordinarily skilled artisan.

A “fragment” is an immunoreactive fragment of an EE protein that has a length of from about 6 amino acids to less than the full length EE protein and includes a sequence that contains at least 6 consecutive amino acids of a sequence of the EE protein. These fragments are collectively referred to herein as “EE peptides.” In some embodiments, the fragment has at least 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, or 100 consecutive amino acids of an EE protein sequence. The fragment can have a length of at most, e.g., 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 120, 140, 160, 180, 200, 220, 240, 260, 280, or 300 amino acids. In some embodiments, an immunoreactive fragment has from six to sixty amino acids, from six to fifty amino acids, from ten to fifty amino acids, from six to twenty amino acids, from eight to twenty amino acids, from ten to twenty amino acids, from twelve to twenty amino acids or from twelve to seventeen amino acids. The immunoreactive fragments contemplated herein specifically exclude any fragment encoded by a 505 bp that is from nucleotide 16,918 to 17,422 of the omp-1 gene cluster sequence, SEQ ID NO: 1, previously reported as the E. ewingii p-28-1 peptide. Thus, the immunoreactive fragments as described herein specifically exclude any of the previously reported sequences for the 505-bp E. ewingii p28-1 sequence, GenBank accession numbers: AF287961 (SEQ ID NOS 127-128), AF287962 (SEQ ID NOS 129-130), AF287963 (SEQ ID NOS 131-132), AF287964 (SEQ ID NOS 133-134, AF287966 (SEQ ID NOS 135-136); or any fragment thereof.

In some embodiments, the immunoreactive peptides are from six (6) amino acids up to less than the full length EE protein, and are antigenic, i.e. are recognized by mammalian immune systems effectively. For this purpose, the peptides comprise segments that are bacterial surface exposed, rather than bacterial cytoplasmic side-exposed or embedded within the lipid bilayer membrane. Such surface exposed regions of EE proteins can be identified using computer programs using algorithms that can predict the three dimensional structure of the EE proteins based on the hydrophobicity/hydrophilicity of the amino acid regions and the repeated β sheet model.

In some embodiments, the fragments comprise a sequence that is identical to at least 6 consecutive amino acids of one or more of the variable loops depicted in Tables 2-5. In other embodiments, the fragment has a sequence that comprises at least 6 consecutive amino acids in any combination of one or more of the variable loops depicted in Tables 2-5. For example, the fragment can have a sequence from loops 1 and 2, loops 1 and 3-1, loops 1 and 3-2, loops 1 and 4, loops 2 and 3-1, loops 2 and 3-2, loops 2 and 4, loops 3-1 and 4, loops 3-2 and 4, loops 1 and 2 and 3-1, loops 1 and 2 and 3-2, loops 2 and 3-1 and 4, loops 2 and 3-2 and 4, loops 1 and 3-1 and 4, loops 1 and 3-2 and 4, loops 1 and 2 and 3-1 and 4, or loops 1 and 2 and 3-2 and 4, etc.

In some embodiments, the immunoreactive fragments (or EE peptides) include the sequences set forth in Table 6. Thus, in one embodiment, the OMP-1-1 peptide includes the sequence, SEQ ID NO: 25; the OMP-1-2 peptide includes the sequence, SEQ ID NO: 26; the OMP-1-3 peptide includes the sequence, SEQ ID NO: 27; the OMP-1-4 peptide includes the sequence, SEQ ID NO: 28; the OMP-1-5 peptide includes the sequence, SEQ ID NO: 29; the OMP-1-6 peptide includes the sequence, SEQ ID NO: 30; the OMP-1-7 peptide includes the sequence, SEQ ID NO: 31; the OMP-1-8 peptide includes the sequence, SEQ ID NO: 32; the OMP-1-9 peptide includes the sequence, SEQ ID NO: 33; the OMP-1-10 peptide includes the sequence, SEQ ID NO: 34; the OMP-1-11 peptide includes the sequence, SEQ ID NO: 35; the OMP-1-12 peptide includes the sequence, SEQ ID NO: 36; the OMP-1-13 peptide includes the sequence, SEQ ID NO: 37; the OMP-1-14 peptide includes the sequence, SEQ ID NO: 38; the OMP-1-15 peptide includes the sequence, SEQ ID NO: 39; the OMP-1-16 peptide includes the sequence, SEQ ID NO: 40; the OMP-1-17 peptide includes the sequence, SEQ ID NO: 41; the OMP-1-18 peptide includes the sequence, SEQ ID NO: 42; and the OMP-1-19 peptide includes the sequence, SEQ ID NO: 43.

TABLE 6 E. ewingii OMP-1 peptide sequences used in ELISA (see Example 1) Peptide length (amino acid SEQ position ID in the protein NO: OMP-1 ID Amino acids sequence sequence) 25 OMP-1-1 SCTEQEMKPAQQNGSSK 17 a.a. (151-167) 26 OMP 1-2 EQHFALASELDTNGNQ 16 a.a. (130-145) 27 OMP 1-3 VSAPSGYDDNIYAYFSI 17 a.a. (123-139) 28 OMP 1-4 FAIPRDTFFNNSIPY 15 a.a. (144-150) 29 OMP 1-5 NSSSLISSNNHYTQLY 16 a.a. (129-144) 30 OMP 1-6 KDNNQVQPKAHDSSSTD 17 a.a. (148-164) 31 OMP 1-7 KDPKDCSVKDAFRHL 15 a.a. (130-144) 32 OMP 1-8 TEDKYLTSEQEVNDY 15 a.a. (120-134) 33 OMP 1-9 ICKENDKPTPKEKKY 15 a.a. (145-159) 34 OMP 1-10 YRYFAIAREMNSSSNNQ 17 a.a. (137-153) 35 OMP 1-11 KNSGHSSIDAHR 12 a.a. (136-147) 36 OMP 1-12 IESDQNKFQPKNANSNS 17 a.a. (154-170) 37 OMP 1-13 SESSKEPQPKNPNSAGN 17 a.a. (159-175) 38 OMP 1-14 KYYGLFREGTPQEEEH 16 a.a. (146-161) 39 OMP 1-15 SRKDNANIGTTPQDKK 16 a.a. (141-156) 40 OMP 1-16 KIEDNQVQNKFTISNY 16 a.a. (76-91) 41 OMP 1-17 QFYREGSNNYKF 12 a.a. (123-134) 42 OMP 1-18 VQDTKSHIVDDNYR 14 a.a. (121-144) 43 OMP-1-19 SKQDNLNSDYVTLIN 15 a.a. (171-185)

Also provided herein are fusion proteins in which a tag or one or more amino acids from a heterologous protein are added to the amino or carboxy terminus of the amino acid sequence of an EE protein or a functional derivative thereof. At least one of the proteins or peptides can be in a multimeric form. As used herein, the term “heterologous protein” means a protein derived from a source other than the E. ewingii omp-1 gene, operationally linked to a E. ewingii protein or a functional derivative thereof, as disclosed in the present specification, to form a chimeric or fusion E. ewingii protein or peptide. Typically, such additions are made to stabilize the resulting fusion protein or to simplify purification of an expressed recombinant form of the corresponding EE protein, variant, or peptide. Such tags are known in the art. Representative examples of such tags include sequences which encode a series of histidine residues, the Herpes simplex glycoprotein D, or glutathione S-transferase. Such a chimeric or fusion protein can have a variety of lengths including, but not limited to, a length of at most 100 residues, at most 200 residues, at most 300 residues, at most 400 residues, at most 500 residues, at most 800 residues or at most 1000 residues. Non-limiting examples of chimeric E. ewingii proteins include fusions of E. ewingii OMPs, or variants, or peptides: with immunogenic polypeptides, such as flagellin and cholera enterotoxin; with immunomodulatory polypeptides, such as IL-2 and B7-1; with tolerogenic polypeptides; with another E. ewingii OMP, or variant, or peptide; and with synthetic sequences. Other examples include linking the EE protein, or variant or peptide with an indicator reagent, an amino acid spacer, an amino acid linker, a signal sequence, a stop transfer sequence, a transmembrane domain, a protein purification ligand or a combination of thereof. The fusion proteins can have similar or substantially similar immunoreactivity to EE antibodies as the EE proteins from which they derive.

The EE polypeptides of the present invention can be used in a variety of procedures and methods, such as for the generation of antibodies, immunogenic compositions and vaccines; for use in identifying pharmaceutical compositions; for studying DNA/protein interaction; as well as for diagnostic and screening methods.

Also provided are compositions of matter comprising one or more EE proteins, their functional derivatives and/or EE fusion proteins. The isolated or purified polypeptide in such compositions can be in a multimeric form and can further include a carrier. The purified polypeptide can be linked to an indicator reagent, an amino acid spacer, an amino acid linker, a signal sequence, a stop transfer sequence, a transmembrane domain, a protein purification ligand, or a combination of these. Alternatively, one or more EE proteins or peptides may be linked together.

Isolated Polynucleotides Coding for EE Polypeptides

The present invention also provides isolated polynucleotides as follows:

(a) a polynucleotide sequence encoding an immature EE protein OMP-1-1, OMP-1-2, OMP-1-3, OMP-1-4, OMP-1-5, OMP-1-6, OMP-1-7, OMP-1-8, OMP-1-9, OMP-1-10, OMP-1-11, OMP-1-12, OMP-1-13, OMP-1-14, OMP-1-15, OMP-1-16, OMP-1-17, OMP-1-18, or OMP-1-19, as described above;

(b) a polynucleotide sequence encoding a mature EE protein OMP-1-1, OMP-1-2, OMP-1-3, OMP-1-4, OMP-1-5, OMP-1-6, OMP-1-7, OMP-1-8, OMP-1-9, OMP-1-10, OMP-1-11, OMP-1-12, OMP-1-13, OMP-1-14, OMP-1-15, OMP-1-16, OMP-1-17, OMP-1-18, or OMP-1-19, as described above;

(c) a polynucleotide sequence encoding a functional derivative of an EE protein of (a) or (b) above;

(d) a polynucleotide sequence encoding a fusion protein of an EE protein or functional derivative of an EE protein;

(e) a polynucleotide sequence complementary to any of the nucleotide sequences in (a), (b), (c) or (d); and

(f) a polynucleotide sequence that hybridizes under highly stringent hybridization conditions with any of the nucleotide sequences in (a), (b), (c), (d) or (e).

The EE protein encoded by the polynucleotide of the invention includes: (1) a mature OMP-1-1 protein encoded by nucleotide 672-1484 of SEQ ID NO: 1; (2) a mature OMP-1-2 protein encoded by nucleotide 2116-2871 of SEQ ID NO: 1; (3) a mature OMP-1-3 protein encoded by nucleotide 3610-4395 of SEQ ID NO: 1; (4) a mature OMP-1-4 protein encoded by nucleotide 4486-5286 of SEQ ID NO: 1; (5) a mature OMP-1-5 protein encoded by nucleotide 5380-6129 of SEQ ID NO: 1; (6) a mature OMP-1-6 protein encoded by nucleotide 6216-7040 of SEQ ID NO: 1; (7) a mature OMP-1-7 protein encoded by nucleotide 7145-7921 of SEQ ID NO: 1; (8) a mature OMP-1-8 protein encoded by nucleotide 8032-8679 of SEQ ID NO: 1; (9) a mature OMP-1-9 protein encoded by nucleotide 8772-9536 of SEQ ID NO: 1; (10) a mature OMP-1-10 protein encoded by nucleotide 9620-10387 of SEQ ID NO: 1; (11) a mature OMP-1-11 protein encoded by nucleotide 10477-11268 of SEQ ID NO: 1; (12) a mature OMP-1-12 protein encoded by nucleotide 11370-12188 of SEQ ID NO: 1; (13) a mature OMP-1-13 protein encoded by nucleotide 12292-13113 of SEQ ID NO: 1; (14) a mature OMP-1-14 protein encoded by nucleotide 14530-15312 of SEQ ID NO: 1; (15) a mature OMP-1-15 protein encoded by nucleotide 15689-16450 of SEQ ID NO: 1; (16) a mature OMP-1-16 protein encoded by nucleotide 16861-17634 of SEQ ID NO: 1; (17) a mature OMP-1-17 protein encoded by nucleotide 18479-19222 of SEQ ID NO: 1; (18) a mature OMP-1-18 protein encoded by nucleotide 19558-20310 of SEQ ID NO: 1; and (19) a mature OMP-1-19 protein encoded by nucleotide 21188-21967 of SEQ ID NO: 1.

The functional derivative encoded by the polynucleotide should not have the sequence SEQ ID NO: 128, 130, 132, 134, 136; or any fragment thereof, and each functional derivative should have a specific binding affinity for an anti-E. ewingii antibody.

In some embodiments, the functional derivative encoded by the polynucleotide comprises a sequence which is at least 85%, 90%, 95% or 98% identical to the sequence of the mature OMP-1 protein (1)-(19), as described above.

In some embodiments, the functional derivative encoded by the polynucleotide comprises an immunoreactive fragment that has a length of from 6 amino acids to less than the full length of the EE protein and comprises 6 or more consecutive amino acids from the following sequences: (1) SEQ ID NO: 137-155; (2) SEQ ID NO: 156-173; (3) SEQ ID NO: 174-191; (4) SEQ ID NO: 192-208; (5) SEQ ID NO: 209-227; or (6) any combination of the sequences (1)-(5); wherein each immunoreactive fragment has a specific binding affinity for an anti-E. ewingii antibody.

In other embodiments, the EE protein encoded by the polynucleotide comprises a sequence selected from the group consisting of: (1) SEQ ID NO: 3 corresponding to an immature OMP-1-1 protein; (2) SEQ ID NO: 4 corresponding to an immature OMP-1-2 protein; (3) SEQ ID NO: 6 corresponding to an immature OMP-1-3 protein; (4) SEQ ID NO: 7 corresponding to an immature OMP-1-4 protein; (5) SEQ ID NO: 8 corresponding to an immature OMP-1-5 protein; (6) SEQ ID NO: 9 corresponding to an immature OMP-1-6 protein; (7) SEQ ID NO: 10 corresponding to an immature OMP-1-7 protein; (8) SEQ ID NO: 11 corresponding to an immature OMP-1-8 protein; (9) SEQ ID NO: 12 corresponding to an immature OMP-1-9 protein; (10) SEQ ID NO: 13 corresponding to an immature OMP-1-10 protein; (11) SEQ ID NO: 14 corresponding to an immature OMP-1-11 protein; (12) SEQ ID NO: 15 corresponding to an immature OMP-1-12 protein; (13) SEQ ID NO: 16 corresponding to an immature OMP-1-13 protein; (14) SEQ ID NO: 17 corresponding to an immature OMP-1-14 protein; (15) SEQ ID NO:18 corresponding to an immature OMP-1-15 protein; (16) SEQ ID NO: 19 corresponding to an immature OMP-1-16; (17) SEQ ID NO: 20 corresponding to an immature OMP-1-17 protein; (18) SEQ ID NO: 21 corresponding to an immature OMP-1-18 protein; (19) SEQ ID NO: 22 corresponding to an immature OMP-1-19 protein.

In one embodiment, the polynucleotide comprises a portion of the nucleotide sequence SEQ ID NO: 1.

It is understood that the polynucleotides encoding the EE polypeptides can have a different sequence than the nucleotide sequence shown in Table 1 due to the degeneracy of the genetic code. Thus, also included within the scope of this invention are the functional equivalents of the herein-described isolated polynucleotides and derivatives thereof. For example, the nucleic acid sequences depicted in SEQ ID NO: 1 can be altered by substitutions, additions or deletions that provide for functionally equivalent molecules. Due to the degeneracy of nucleotide coding sequences, other DNA sequences which encode substantially the same amino acid sequences as depicted in Tables 1-6 can be used in the practice of the present invention. These include but are not limited to nucleotide sequences comprising all or portions of SEQ ID NO: 1 that encode for EE polypeptides, which are altered by the substitution of different codons that encode a functionally equivalent amino acid residue within the sequence.

In addition, the polynucleotide can comprise a nucleotide sequence which results from the addition, deletion or substitution of at least one nucleotide to the 5′-end and/or the 3′-end of the nucleic acid segments of SEQ ID NO:1, or a derivative thereof. Any polynucleotide can be used in this regard, provided that its addition, deletion or substitution does not substantially alter the amino acid sequence of the EE protein, or functional derivatives or fusion proteins thereof, encoded by the polynucleotide sequence. Moreover, the polynucleotide of the present invention can, as necessary, have restriction endonuclease recognition sites added to its 5′-end and/or 3′-end. All variations of the nucleotide sequence of the EE omp-1 gene and fragments thereof permitted by the genetic code are, therefore, included in this invention.

Further, it is possible to delete codons or to substitute one or more codons by codons other than degenerate codons to produce a structurally modified polypeptide, but one which has substantially the same utility or activity of the polypeptide produced by the unmodified nucleic acid molecule. As recognized in the art, the two polypeptides are functionally equivalent, as are the two nucleic acid molecules which give rise to their production, even though the differences between the nucleic acid molecules are not related to degeneracy of the genetic code.

Finally, the isolated polynucleotides of the invention expressly exclude any polynucleotide that encodes for a peptide that is the 505-bp E. ewingii p28-1 peptide deposited in GenBank accession numbers: AF287961 (SEQ ID NOS 127-128), AF287962 (SEQ ID NOS 129-130), AF287963 (SEQ ID NOS 131-132), AF287964 (SEQ ID NOS 133-134, AF287966 (SEQ ID NOS 135-136); or any fragment thereof.

Isolation of Polynucleotides

The isolated EE polynucleotides coding for EE polypeptides can be isolated from a biological sample (e.g. of mammalian or tick origin) containing EE RNA or DNA.

The polynucleotide can be isolated from a biological sample containing EE RNA using the techniques of cDNA cloning and subtractive hybridization. The polynucleotide can also be isolated from a cDNA library using a homologous probe, i.e., a probe comprising sequences substantially identical or complementary to portions or all of the polynucleotide sequence SEQ ID NO: 1; or with antibodies immunospecific for an EE OMP protein or peptide to identify clones containing such polynucleotides.

The polynucleotide can be isolated from a biological sample containing genomic DNA or from a genomic library. Suitable biological samples include, but are not limited to, whole organisms, organs, tissues, blood and cells. The method of obtaining the biological sample will vary depending upon the nature of the sample.

One skilled in the art will realize that EE polynucleotides may also be isolated from a number of infected eukaryotes (for example, mammals, birds, fish and humans) that may contain the EE protein genes.

Synthesis of Polynucleotides

Isolated polynucleotides of the present invention include those chemically synthesized. For example, a polynucleotide with the nucleotide sequence which codes for the expression product of an EE protein gene can be designed and, if necessary, divided into appropriate smaller fragments. Then an oligomer which corresponds to the polynucleotide, or to each of the divided fragments, can be synthesized. Such synthetic oligonucleotides can be prepared, for example, by the triester method of Matteucci et al., J. Am. Chem. Soc. 103:3185-3191 (1981) or by using an automated DNA synthesizer.

An oligonucleotide can be derived synthetically or by cloning. If necessary, the 5′-ends of the oligomers can be phosphorylated using T4 polynucleotide kinase. Kinasing of single strands prior to annealing or for labeling can be achieved using an excess of the enzyme. If kinasing is for the labeling of probe, the ATP can contain high specific activity radioisotopes. Then, the DNA oligomer can be subjected to annealing and ligation with T4 ligase or the like.

Also encompassed by the present invention, are single stranded polynucleotides, hereinafter referred to as antisense polynucleotides, having sequences which are complementary to the DNA and RNA sequences which encode the EE proteins and peptides, or their functional variants.

DNA Constructs Comprising a Polynucleotide Encoding an Ee Polypeptide, and Cells Containing These Constructs

The EE polynucleotides described herein are useful for producing the EE polypeptides. For example, an RNA molecule encoding an EE polypeptide can be used in a cell-free translation system to prepare an isolated polypeptide corresponding to an EE protein, its functional derivatives or fusion proteins. Alternatively, a DNA molecule encoding an EE polypeptide can be introduced into an expression vector and used to transform cells.

Accordingly, in another aspect, the present invention relates to a recombinant DNA molecule comprising, 5′ to 3′, a promoter effective to initiate transcription in a host cell and one or more of the above-described EE polynucleotides. In another embodiment, the present invention relates to a recombinant DNA molecule comprising a vector and an above-described polynucleotide.

In another embodiment, the present invention relates to a nucleic acid molecule comprising a transcriptional control region functional in a cell, a sequence complimentary to an RNA sequence encoding an amino acid sequence corresponding to an above-described EE polypeptide, and a transcriptional termination region functional in the cell.

In one embodiment, the above-described molecules are isolated and/or purified DNA molecules.

In another embodiment, the present invention relates to a cell or non-human organism that has been altered to express the EE polypeptides and so contains one or more of the above-described nucleic acid molecules.

In another embodiment, the polypeptide is purified from cells which have been altered to express the polypeptide.

As used herein, a cell, or organism, is said to be “altered to express a desired polypeptide” when the cell, or organism, through genetic manipulation, is made to produce a protein which it normally does not produce or which it normally produces at low levels. One skilled in the art can readily adapt procedures for introducing and expressing either genomic, cDNA, or synthetic sequences into either eukaryotic or prokaryotic cells.

A nucleic acid molecule, such as DNA, is said to be “capable of expressing” a polypeptide if it contains nucleotide sequences which contain transcriptional and translational regulatory information and such sequences are “operably linked” to nucleotide sequences which encode the polypeptide. An operable linkage is a linkage in which the regulatory DNA sequences and the DNA sequence sought to be expressed are connected in such a way as to permit gene sequence expression. The precise nature of the regulatory regions needed for gene sequence expression can vary from organism to organism, but shall in general include a promoter region which, in prokaryotes, contains both the promoter (which directs the initiation of RNA transcription) as well as the DNA sequences which, when transcribed into RNA, will signal synthesis initiation. Such regions will normally include those 5′-non-coding sequences involved with initiation of transcription and translation, such as the TATA box, capping sequence, CAAT sequence, and the like.

If desired, the non-coding region 3′ to the EE polypeptide coding sequence can be obtained by the above-described methods. This region can be retained for its transcriptional termination regulatory sequences, such as termination and polyadenylation. Thus, by retaining the 3′-region naturally contiguous to the DNA sequence encoding an EE protein gene, the transcriptional termination signals can be provided. Where the transcriptional termination signals are not satisfactorily functional in the expression host cell, then a 3′ region functional in the host cell can be substituted.

Two DNA sequences (such as a promoter region sequence and an EE polypeptide coding sequence) are said to be operably linked if the nature of the linkage between the two DNA sequences does not (1) result in the introduction of a frame-shift mutation, (2) interfere with the ability of the promoter region sequence to direct the transcription of an EE polypeptide coding sequence, or (3) interfere with the ability of the EE polypeptide coding sequence to be transcribed by the promoter region sequence. Thus, a promoter region would be operably linked to a DNA sequence if the promoter were capable of effecting transcription of that DNA sequence.

The present invention encompasses the expression of the EE polypeptide coding sequence in either prokaryotic or eukaryotic cells. Prokaryotic hosts are, generally, the most efficient and convenient for the production of recombinant proteins and, therefore, are suitable for the expression of the EE polypeptide coding sequence.

Prokaryotes most frequently are represented by various strains of E. coli. However, other microbial strains can also be used, including other bacterial strains. In prokaryotic systems, plasmid vectors that contain replication sites and control sequences derived from a species compatible with the host can be used. Examples of suitable plasmid vectors include, but are not limited to, pBR322, pUC18, pUC19, pUC118, pUC119 and the like; suitable phage or bacteriophage vectors include λgt10, λgt11 and the like; and suitable virus vectors include pMAM-neo, pKRC and the like. In some examples, the selected vector of the present invention has the capacity to replicate in the selected host cell.

Recognized prokaryotic hosts include bacteria such as E. coli, Bacillus, Streptomyces, Pseudomonas, Salmonella, Serratia, and the like. However, under such conditions, the peptide will not be glycosylated. The prokaryotic host must be compatible with the replicon and control sequences in the expression plasmid.

To express EE polypeptides in a prokaryotic cell, it is necessary to operably link the EE protein coding sequence to a functional prokaryotic promoter. Such promoters can be either constitutive or regulatable (i.e., inducible or derepressible). Examples of constitutive promoters include, but are not limited to, the int promoter of bacteriophage λ, the bla promoter of the β-lactamase gene sequence of pBR322, and the CAT promoter of the chloramphenicol acetyl transferase gene sequence of pBR325, and the like. Examples of inducible prokaryotic promoters include, but are not limited to, the major right and left promoters of bacteriophage λ (P_(L) and P_(R)), the trp, recA, lacZ, lacI, and gal promoters of E. coli, the α-amylase (Ulmanen et al., J. Bacteriol. 162:176-182 (1985)) and the ξ-28-specific promoters of B. subtilis (Gilman et al., Gene sequence 32:11-20 (1984)), the promoters of the bacteriophages of Bacillus (Gryczan, In: The Molecular Biology of the Bacilli, Academic Press, Inc., NY (1982)), and Streptomyces promoters (Ward et al., Mol. Gen. Genet. 203:468-478 (1986)). Prokaryotic promoters are reviewed by Glick (J. Ind. Microbiol. 1:277-282 (1987)); Cenatiempo (Biochimie 68:505-516 (1986)); and Gottesman (Ann. Rev. Genet. 18:415-442 (1984)).

Proper expression in a prokaryotic cell may also require the presence of a ribosome binding site upstream of the gene sequence-encoding sequence. Such ribosome binding sites are disclosed, for example, by Gold et al (Ann. Rev. Microbiol. 35:365-404 (1981)).

The selection of control sequences, expression vectors, transformation methods, and the like, are dependent on the type of host cell used to express the gene. As used herein, “cell”, “cell line”, and “cell culture” can be used interchangeably and all such designations include progeny. Thus, the words “transformants” or “transformed cells” include the primary subject cell and cultures derived therefrom, without regard to the number of transfers. It is also understood that all progeny can not be precisely identical in DNA content, due to deliberate or inadvertent mutations. However, as defined, mutant progeny have the same functionality as that of the originally transformed cell. Host cells which can be used in the expression systems of the present invention are not strictly limited, provided that they are suitable for use in the expression of the EE polypeptides of interest. Suitable hosts include eukaryotic cells.

Some examples of suitable eukaryotic hosts include, but are not limited to, yeast, fungi, insect cells, mammalian cells either in vivo, or in tissue culture. Suitable mammalian cells include HeLa cells, cells of fibroblast origin such as VERO or CHO-K1, or cells of lymphoid origin and their derivatives.

In addition, plant cells are also available as hosts, and control sequences compatible with plant cells are available, such as the cauliflower mosaic virus 35S and 19S, and nopaline synthase promoter and polyadenylation signal sequences.

Another type of host is an insect cell, for example Drosophila larvae. Using insect cells as hosts, the Drosophila alcohol dehydrogenase promoter can be used, Rubin, Science 240:1453-1459 (1988). Alternatively, baculovirus vectors can be engineered to express large amounts of EE polypeptides in insect cells (Jasny, Science 238:1653 (1987); Miller et al., In: Genetic Engineering (1986), Setlow, J. K., et al., eds., Plenum, Vol. 8, pp. 277-297).

Different host cells have characteristic and specific mechanisms for the translational and post-translational processing and modification (e.g., glycosylation, cleavage) of proteins. Appropriate cell lines or host systems can be chosen to ensure the desired modification and processing of the foreign protein expressed.

Any of a series of yeast gene sequence expression systems can be utilized which incorporate promoter and termination elements from the actively expressed gene sequences coding for glycolytic enzymes. These enzymes are produced in large quantities when yeast are grown in mediums rich in glucose. Known glycolytic gene sequences can also provide very efficient transcriptional control signals.

Yeast can provide substantial advantages in that it can also carry out post-translational peptide modifications. A number of recombinant DNA strategies exist which utilize strong promoter sequences and high copy number of plasmids which can be utilized for production of the desired proteins in yeast. Yeast recognizes leader sequences on cloned mammalian gene sequence products and secretes peptides bearing leader sequences (i.e., pre-peptides). For a mammalian host, several possible vector systems are available for the expression of EE polypeptides.

A wide variety of transcriptional and translational regulatory sequences can be employed, depending upon the nature of the host. The transcriptional and translational regulatory signals can be derived from viral sources, such as adenovirus, bovine papilloma virus, simian virus, or the like, where the regulatory signals are associated with a particular gene sequence which has a high level of expression. Alternatively, promoters from mammalian expression products, such as actin, collagen, myosin, and the like, can be employed. Transcriptional initiation regulatory signals can be selected which allow for repression or activation, so that expression of the gene sequences can be modulated. Of interest are regulatory signals which are temperature-sensitive so that by varying the temperature, expression can be repressed or initiated, or are subject to chemical (such as metabolite) regulation.

As discussed above, expression of EE polypeptides in eukaryotic hosts requires the use of eukaryotic regulatory regions. Such regions will, in general, include a promoter region sufficient to direct the initiation of RNA synthesis. Examples of eukaryotic promoters include, but are not limited to, the promoter of the mouse metallothionein I gene sequence (Hamer et al., J. Mol. Appl. Gen. 1:273-288 (1982)); the TK promoter of Herpes virus (McKnight, Cell 31:355-365 (1982)); the SV40 early promoter (Benoist et al., Nature (London) 290:304-310 (1981)); the yeast gal4 gene sequence promoter (Johnston et al., Proc. Natl. Acad. Sci. (USA) 79:6971-6975 (1982); Silver et al., Proc. Natl. Acad. Sci. (USA) 81:5951-5955 (1984)) and the CMV immediate-early gene promoter (Thomsen et al., Proc. Natl. Acad. Sci (USA) 81:659-663 (1984)).

As is widely known, translation of eukaryotic mRNA is initiated at the codon which encodes the first methionine. For this reason, it may be desirable to ensure that the linkage between a eukaryotic promoter and an EE polypeptide coding sequence does not contain any intervening codons which are capable of encoding a methionine (i.e., AUG). The presence of such codons results either in a formation of a fusion protein (if the AUG codon is in the same reading frame as the EE polypeptide coding sequence) or a frame-shift mutation (if the AUG codon is not in the same reading frame as the EE polypeptide coding sequence).

A nucleic acid molecule encoding an EE polypeptide and an operably linked promoter can be introduced into a recipient prokaryotic or eukaryotic cell either as a non-replicating DNA (or RNA) molecule, which can either be a linear molecule or a closed covalent circular molecule. Since such molecules are incapable of autonomous replication, the expression of the gene can occur through the transient expression of the introduced sequence. Alternatively, permanent expression can occur through the integration of the introduced DNA sequence into the host chromosome.

In one embodiment, a vector is employed which is capable of integrating the desired gene sequences into the host cell chromosome. Cells which have stably integrated the introduced DNA into their chromosomes can be selected by also introducing one or more markers which allow for selection of host cells which contain the expression vector. The marker can provide for prototrophy to an auxotrophic host, biocide resistance, e.g., antibiotics, or heavy metals, such as copper, or the like. The selectable marker gene sequence can either be directly linked to the DNA gene sequences to be expressed, or introduced into the same cell by co-transfection. Additional elements can also be needed for optimal synthesis of single chain binding protein mRNA. These elements can include splice signals, as well as transcription promoters, enhancer signal sequences, and termination signals. cDNA expression vectors incorporating such elements include those described by Okayama, Molec. Cell. Biol. 3:280 (1983).

In some embodiments, the introduced nucleic acid molecule will be incorporated into a plasmid or viral vector capable of autonomous replication in the recipient host. Any of a wide variety of vectors can be employed for this purpose. Factors of importance in selecting a particular plasmid or viral vector include: the ease with which recipient cells that contain the vector can be recognized and selected from those recipient cells which do not contain the vector; the number of copies of the vector which are desired in a particular host; and whether it is desirable to be able to “shuttle” the vector between host cells of different species.

Examples of prokaryotic vectors include, but are not limited to, plasmids such as those capable of replication in E. coli (such as, for example, pBR322, ColE1, pSC101, pACYC 184, πVX. Such plasmids are, for example, disclosed by Sambrook (Molecular Cloning: A Laboratory Manual, second edition, edited by Sambrook, Fritsch, & Maniatis, Cold Spring Harbor Laboratory, 1989). Bacillus plasmids include pC194, pC221, pT127, and the like. Such plasmids are disclosed by Gryczan (In: The Molecular Biology of the Bacilli, Academic Press, NY (1982), pp. 307-329). Suitable Streptomyces plasmids include pIJ101 (Kendall et al., J. Bacteriol. 169:4177-4183 (1987)), and streptomyces bacteriophages such as φC31 (Chater et al., In: Sixth International Symposium on Actinomycetales Biology, Akaderniai Kaido, Budapest, Hungary (1986), pp. 45-54). Pseudomonas plasmids are reviewed by John et al (Rev. Infect. Dis. 8:693-704 (1986)), and Izaki (Jpn. J. Bacteriol. 33:729-742 (1978)).

Examples of suitable eukaryotic plasmids include, but are not limited to, BPV, vaccinia, SV40, 2-micron circle, and the like, or their derivatives. Such plasmids are well known in the art (Botstein et al., Miami Wntr. Symp. 19:265-274 (1982); Broach, In: The Molecular Biology of the Yeast Saccharomyces: Life Cycle and Inheritance, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., p. 445-470 (1981); Broach, Cell 28:203-204 (1982); Bollon et al., J. Clin. Hematol. Oncol. 10:39-48 (1980); Maniatis, In: Cell Biology: A Comprehensive Treatise, Vol. 3, Gene Sequence Expression, Academic Press, NY, pp. 563-608 (1980)).

Once the vector or nucleic acid molecule containing the construct(s) has been prepared for expression, the DNA construct(s) can be introduced into an appropriate host cell by any of a variety of suitable means, e.g., transformation, transfection, conjugation, protoplast fusion, electroporation, particle gun technology, calcium phosphate-precipitation, direct microinjection, and the like. After the introduction of the vector, recipient cells are grown in a selective medium, which selects for the growth of vector-containing cells. Expression of the cloned gene molecule(s) results in the production of EE polypeptide. This can take place in the transformed cells as such, or following the induction of these cells to differentiate (for example, by administration of bromodeoxyuracil to neuroblastoma cells or the like).

Nucleic Acid Probes and Primers for the Specific Detection of E. ewingii

The EE polynucleotides described herein are also useful for designing hybridization probes for isolating and identifying cDNA clones and genomic clones encoding the EE proteins, peptides or allelic forms thereof. Such hybridization techniques are known to those of skill in the art.

Therefore, in another embodiment, a nucleic acid probe is provided for the specific detection of the presence of one or more EE polynucleotides in a sample comprising the above-described isolated polynucleotides or at least a fragment thereof, which binds under stringent conditions, or highly stringent conditions, to EE polynucleotides.

The term “stringent conditions” as used herein is the binding which occurs within a range from about Tm 5° C. (5° C. below the melting temperature Tm of the probe) to about 200 C. to 25° C. below Tm. The term “highly stringent hybridization conditions” as used herein refers to conditions of: at least about 6×SSC and 1% SDS at 65° C., with a first wash for 10 minutes at about 42° C. with about 20% (v/v) formamide in 0.1×SSC, and with a subsequent wash with 0.2×SSC and 0.1% SDS at 65° C.

In one embodiment, the isolated nucleic acid probe consisting of 10 to 1000 nucleotides (for example: 10 to 500, 10 to 250, 10 to 100, 10 to 50, 10 to 35, 20 to 1000, 20 to 500, 20 to 250, 20 to 100, 20 to 50, or 20 to 35, etc.) which hybridizes preferentially to RNA or DNA of EE but not to RNA or DNA of non-EE organisms, wherein said nucleic acid probe is or is complementary to a nucleotide sequence consisting of at least 10 consecutive nucleotides, or 15, 20, 25, 30, 50, 100, 250, 500, 600, 700, 800, or 900 consecutive nucleotides, or along the entire length, of one or more of the EE polynucleotides described above.

In some embodiments, the nucleic acid probe comprises a polynucleotide sequence encoding a polypeptide that corresponds to one or more of the variable loop sequences in Tables 2-6. Such probes would hybridize with a specific polynucleotide encoding a polypeptide corresponding to a variable sequence in each EE OMP protein and so will be specific to EE, as opposed to the other ehrlichia species. Methods for designing probes that are specific for EE polynucleotide sequences based on the sequence alignment provided in FIG. 7 are well known in the art.

In other embodiments, the nucleic acid probe can be used to probe an appropriate chromosomal or cDNA library by usual hybridization methods to obtain another nucleic acid molecule of the present invention. A chromosomal DNA or cDNA library can be prepared from appropriate cells according to recognized methods in the art (see e.g. Molecular Cloning: A Laboratory Manual, second edition, edited by Sambrook, Fritsch, & Maniatis, Cold Spring Harbor Laboratory, 1989).

Such hybridization probes can have a sequence which is at least 90%, 95%, 98%, 99% or 100% complementary with a sequence contained within the sense strand of a DNA molecule which encodes each of the EE proteins or with a sequence contained within its corresponding antisense strand. Such hybridization probes bind to the sense or antisense strand under stringent, or highly stringent, conditions.

The term “stringent conditions” as used herein is the binding which occurs within a range from about Tm 5° C. (5° C. below the melting temperature Tm of the probe) to about 20° C. to 25° C. below Tm. The term “highly stringent hybridization conditions” as used herein refers to conditions of: at least about 6×SSC and 1% SDS at 65° C., with a first wash for 10 minutes at about 42° C. with about 20% (v/v) formamide in 0.1×SSC, and with a subsequent wash with 0.2×SSC and 0.1% SDS at 65° C.

The probes can be used in a number of assays, such as Northern assays to detect transcripts of EE omp-1 homologous genes and in Southern assays to detect EE omp-1 homologous genes. In some examples, the identity of probes which are e.g. 200 nucleotides in length and have full complementarity with a portion of the antisense strand of a double-stranded DNA molecule which encodes the EE proteins can be determined using EE protein-encoding segments of the nucleotide sequence, SEQ ID NO: 1.

The hybridization probes of the present invention can be labeled by standard labeling techniques such as with a radiolabel, enzyme label, fluorescent label, biotin-avidin label, chemiluminescence, and the like. After hybridization, the probes can be visualized using known methods.

The nucleic acid probes of the present invention include RNA, as well as DNA probes, such probes being generated using techniques known in the art.

In one embodiment of the above described method, a nucleic acid probe is immobilized on a solid support. Examples of such solid supports include, but are not limited to, plastics such as polycarbonate, complex carbohydrates such as agarose and sepharose, and acrylic resins, such as polyacrylamide and latex beads. Techniques for coupling nucleic acid probes to such solid supports are well known in the art.

The EE polynucleotides disclosed herein are also useful for designing primers for polymerase chain reaction (PCR), a technique useful for obtaining large quantities of cDNA molecules that encode the EE polypeptides. PCR primers can also be used for diagnostic purposes.

Thus, also included are oligonucleotides that are used as primers in polymerase chain reaction (PCR) technologies to amplify transcripts of the genes which encode the EE polypeptides, or portions of such transcripts. In some examples, the primers comprise a minimum of about 12 to 15 nucleotides and a maximum of about 30 to 35 nucleotides. The primers can have a G+C content of 40% or greater. Such oligonucleotides are at least 98% complementary with a portion of the DNA strand, i.e., the sense strand, which encodes the EE protein, or a portion of its corresponding antisense strand. In some embodiments, the primer has at least 99% complementarity, or 100% complementarity, with such sense strand or its corresponding antisense strand. Primers which have 100% complementarity with the antisense strand of a double-stranded DNA molecule encoding an EE protein have a sequence which is identical to a sequence contained within the sense strand.

One skilled in the art can readily design such probes based on the sequences disclosed herein using methods of computer alignment and sequence analysis known in the art (see, for example, Molecular Cloning: A Laboratory Manual, second edition, edited by Sambrook, Fritsch, & Maniatis, Cold Spring Harbor Laboratory, 1989). For example, the identity of primers which are 15 nucleotides in length and have full complementarity with a portion of the antisense strand of a double-stranded DNA molecule which encodes the EE proteins can be determined using the EE protein segments of the nucleotide sequence, SEQ ID NO: 1.

In some embodiments, the primers have one or more of the following features: (i) the 3′ end of the primer set is either C or G; (ii) at least the 3′ end of each primer has the same sequence as the target DNA (or its reverse complement), i.e. the 5′ end can have variations and still produce a proper PCR product; (iii) the primer is about 20 bp long; (iv) both members of a primer set are around the same length; (v) the GC total and AT total for both primers in a set are the same, or very similar; (vi) the primer should be designed after performing a BLAST search on each primer to determine if there could be alignment for amplification of an unwanted species; and/or (vii) use the formula 4×(G+C)+2×(A+T)=PCR Annealing Temperature (±5° C.) for a primer set. For example, Ohashi N, et al. (2001) Infection and Immunity, 69:2083-91, the entire contents of which are incorporated herein by reference, describes multiple primers designed to detect all 22 p30 proteins of E. canis. A similar strategy can be used for E. ewingii primer design.

In one embodiment, the primers are designed to amplify a target DNA segment that is useful for diagnostic purposes. The target sequence can have a length of 50-300 nucleotide bases, 50-200 bases, 80-200 bases, 100-300, 300-600, 600-800, 600-1000, or 800-1000 bases. In some embodiments, the target DNA comprises a polynucleotide sequence encoding a polypeptide comprising one or more of the variable loop sequences in Tables 2-6. Such primers would amplify a specific polynucleotide segment encoding a polypeptide whose sequence corresponds to a variable sequence in each EE OMP protein and so will be specific to EE, as opposed to other ehrlichia species. Methods for designing primers that are specific for EE polynucleotide sequences based on the sequence alignment provided in FIGS. 7 and 8 are well known in the art.

In another embodiments, the target DNA segment has a nucleotide sequence that encodes a polypeptide that is conserved among E. ewingii strains, but is distinct from other Ehrlichia strains. Examples of conserved sequences among E. ewingii strains are shown as darkened areas in the attached sequence alignment (FIGS. 7 and 8). Using as target DNA sequences that encode for conserved regions of EE proteins increases the sensitivity of the PCR reaction because there are more than two copies of the target sequence in the genome. This increases the PCR's sensitivity more than two fold. For example, U.S. Pat. No. 6,432,649 to Stich et al., the entire contents of which is incorporated herein by reference, describes methods of designing such primers based on sequence alignments of E. canis and E. chaffeensis for the specific diagnosis of each of these species. A similar method can be employed to design optimal primer sets for E. ewingii diagnosis.

In other embodiments, the primers are designed for nested PCR, or target more than two regions of the target sequence to increase sensitivity. Nested PCR is a conventional PCR with a second round of amplification using a different set of primers. This second set of primers is specific to a sequence found within the target DNA of the initial conventional PCR amplicon. The use of a second amplification step with the “nested” primer set results in a reduced background from products amplified during the initial PCR due to the nested primers' additional specificity to the region. The amount of amplicon produced is increased as a result of the second round of amplification and due to a reduction in any inhibitor concentrations. For example, the first set of primers can target variable loops 1 and 4, and the second nested primer set can target variable loops 2 or 3.

Primer design choices that can increase the PCR reaction's specificity include using primers that border the target DNA sequence together with higher annealing temperatures.

The term “primer” as used herein refers to an oligonucleotide, whether occurring naturally as in a purified restriction digest or produced synthetically, which is capable of acting as a point of initiation of synthesis when placed under conditions in which synthesis of a primer extension product complementary to a nucleic acid strand is induced. The conditions include the presence of nucleotides and an inducing agent such as a DNA polymerase and a suitable temperature and pH. The primer may be either single-stranded or double-stranded and must be sufficiently long to prime the synthesis of the desired extension product in the presence of the inducing agent. The exact length of the primer will depend upon many factors, including temperature, source of primer and the method used. For example, for diagnostic applications, the oligonucleotide primer typically contains 15-30 or more nucleotides depending on the complexity of the target sequence. Primers with fewer nucleotides may also be used.

The primers herein are selected to be “substantially” complementary to different strands of a particular target DNA sequence. This means that the primers must be sufficiently complementary to hybridize with their respective strands. Therefore, the primer sequence need not reflect the exact sequence of the template. For example, a non-complementary nucleotide fragment may be attached to the 5′ end of the primer, with the remainder of the primer sequence being complementary to the strand. Alternatively, non-complementary bases or longer sequences can be interspersed into the primer, provided that the primer sequence has sufficient complementary with the sequence or hybridize therewith and thereby form the template for the synthesis of the extension product.

Antibodies

Also contemplated herein is an isolated or purified antibody having specific binding affinity to an EE polypeptide as described above.

An antibody that has a “specific binding affinity” to an EE polypeptide, is an antibody that binds with a substantially greater affinity to the EE polypeptide, than to an E. canis or E. chaffeensis protein.

Any of a variety of routine assays can be used for detecting antigen-antibody complexes, the presence of which is an indicator of selective binding. Such assays include, without limitation, enzyme-linked immunosorbent assays (ELISA), radioimmunoassays, western blotting, enzyme immunoassays, fluorescence immunoassays, luminescent immunoassays and the like. Methods for detecting a complex between a peptide and an antibody, and thereby identifying an antibody with specific binding affinity to an EE polypeptide are well known to those skilled in the art and are described, for example, in ANTIBODIES: A LABORATORY MANUAL (Edward Harlow & David Lane, eds., Cold Spring Harbor Laboratory Press, 2.sup.nd ed. 1998a); and USING ANTIBODIES: A LABORATORY MANUAL: PORTABLE PROTOCOL No. I (Edward Harlow & David Lane, Cold Spring Harbor Laboratory Press, 1998b), which are hereby incorporated by reference in their entirety.

In one embodiment, the “specific binding affinity” of an antibody is defined as an ELISA assay result, where the ratio of E. chaffeensis or E. canis polypeptide reactivity/control plasma reactivity is ˜1.00, and where E. ewingii polypeptide reactivity yields an OD_(405nm)-OD_(492nm) value greater than the mean OD_(405nm)-OD_(492nm) of preinfection control plasma+three standard deviations. In this example, the antibodies can be obtained from a subject infected with E. ewingii. The control plasma can include preinfection plasma, plasma from subjects infected with anything other than E. ewingii, or plasma from subjects infected with E. chaffeensis or E. canis.

The EE polypeptides can be used to produce antibodies or hybridomas. One skilled in the art will recognize that if an antibody is desired, such a polypeptide would be generated as described herein and used as an immunogen.

The produced antibodies are useful research tools for diagnostic and screening purposes, for identifying cells, such as granulocytes, infected with E. ewingii and for purifying the major outer membrane protein of E. ewingii from partially purified preparations by affinity chromatography. Such antibodies are also useful for identifying bacterial colonies, particularly colonies of genetically-engineered bacteria, that are expressing the major outer membrane protein of E. ewingii.

The antibodies described herein can also be used in a composition to be administered to a subject in need thereof, to reduce the level of E. ewingii infection in the subject. Such a reduction refers to a reduction or elimination of clinical signs and symptoms of E. ewingii infection in the subject. Alternatively the antibody can be used to prevent infection with E. ewingii in a subject.

The antibodies include monoclonal and polyclonal antibodies, as well as fragments of these antibodies. The invention further includes single chain antibodies. Antibody fragments which contain the idiotype of the molecule can be generated by known techniques. For example, such fragments include but are not limited to: the F(ab′)₂ fragment; the Fab′ fragments, Fab fragments, and Fv fragments.

In one embodiment, the antibodies to EE polypeptides are produced in humans, or are “humanized” (i.e. non-immunogenic in a human) by recombinant or other technology. Humanized antibodies can be produced, for example by replacing an immunogenic portion of an antibody with a corresponding, but non-immunogenic portion (i.e. chimeric antibodies) (Robinson, R. R. et al., International Patent Publication PCT/US86/02269; Akira, K. et al., European Patent Application 184,187; Taniguchi, M., European Patent Application 171,496; Morrison, S. L. et al., European Patent Application 173,494; Neuberger, M. S. et al., PCT Application WO 86/01533; Cabilly, S. et al., European Patent Application 125,023; Better, M. et al., Science 240:1041-1043 (1988); Liu, A. Y. et al., Proc. Natl. Acad. Sci. USA 84:3439-3443 (1987); Liu, A. Y. et al., J. Immunol. 139:3521-3526 (1987); Sun, L. K. et al., Proc. Natl. Acad. Sci. USA 84:214-218 (1987); Nishimura, Y. et al., Canc. Res. 47:999-1005 (1987); Wood, C. R. et al., Nature 314:446-449 (1985); Shaw et al., J. Natl. Cancer Inst. 80:1553-1559 (1988)). General reviews of “humanized” chimeric antibodies are provided by Morrison, S. L. (Science, 229:1202-1207 (1985)) and by Oi, V. T. et al., BioTechniques 4:214 (1986)). Suitable “humanized” antibodies can be alternatively produced by CDR or CEA substitution (Jones, P. T. et al., Nature 321:552-525 (1986); Verhoeyan et al., Science 239:1534 (1988); Beidler, C. B. et al., J. Immunol. 141:4053-4060 (1988)).

In another embodiment, the present invention relates to a hybridoma which produces the above-described monoclonal antibody. A hybridoma is an immortalized cell line which is capable of secreting a specific monoclonal antibody.

In general, techniques for preparing monoclonal antibodies and hybridomas are well known in the art (Campbell, “Monoclonal Antibody Technology: Laboratory Techniques in Biochemistry and Molecular Biology,” Elsevier Science Publishers, Amsterdam, The Netherlands (1984); St. Groth et al., J. Immunol. Methods 35:1-21 (1980)).

Any animal (mouse, rabbit, and the like) which is known to produce antibodies can be immunized with the selected polypeptide. Methods for immunization are well known in the art. Such methods include subcutaneous or interperitoneal injection of the polypeptide. One skilled in the art will recognize that the amount of polypeptide used for immunization will vary based on the animal which is immunized, the antigenicity of the polypeptide and the site of injection.

The polypeptide can be modified or administered in an adjuvant in order to increase the peptide antigenicity. Methods of increasing the antigenicity of a polypeptide are well known in the art. Such procedures include coupling the antigen with a heterologous protein (such as globulin or β-galactosidase) or through the inclusion of an adjuvant during immunization.

For monoclonal antibodies, spleen cells from the immunized animals are removed, fused with myeloma cells, and allowed to become monoclonal antibody producing hybridoma cells. Any one of a number of methods well known in the art can be used to identify the hybridoma cell which produces an antibody with the desired characteristics. These include screening the hybridomas with an ELISA assay, western blot analysis, or radioimmunoassay (Lutz et al., Exp. Cell Res. 175:109-124 (1988)). Hybridomas secreting the desired antibodies are cloned and the class and subclass is determined using procedures known in the art (Campbell, Monoclonal Antibody Technology: Laboratory Techniques in Biochemistry and Molecular Biology, supra (1984)).

For polyclonal antibodies, antibody containing antisera is isolated from the immunized animal and is screened for the presence of antibodies with the desired specificity using one of the above-described procedures.

In another embodiment, the above-described antibodies are detectably labeled. Antibodies can be detectably labeled through the use of radioisotopes, affinity labels (such as biotin, avidin, and the like), enzymatic labels (such as horseradish peroxidase, alkaline phosphatase, and the like) fluorescent labels (such as FITC or rhodamine, and the like), paramagnetic atoms, and the like. Procedures for accomplishing such labeling are well-known in the art, for example, see (Sternberger et al., J. Histochem. Cytochem. 18:315 (1970); Bayer et al., Meth. Enzym. 62:308 (1979); Engval et al., Immunol. 109:129 (1972); Goding, J. Immunol. Meth. 13:215 (1976)). The labeled antibodies of the present invention can be used for in vitro, in vivo, and in situ assays to identify cells or tissues which express a specific polypeptide.

In another embodiment, the above-described antibodies are immobilized on a solid support. Examples of such solid supports include plastics such as polycarbonate, complex carbohydrates such as agarose and sepharose, acrylic resins and such as polyacrylamide and latex beads. Techniques for coupling antibodies to such solid supports are well known in the art (Weir et al., “Handbook of Experimental Immunology” 4th Ed., Blackwell Scientific Publications, Oxford, England, Chapter 10 (1986); Jacoby et al., Meth. Enzym. 34 Academic Press, N.Y. (1974)). The immobilized antibodies can be used for in vitro, in vivo, and in situ assays as well as in immunochromatography.

Furthermore, one skilled in the art can readily adapt currently available procedures, as well as the techniques, methods and kits disclosed above with regard to antibodies, to generate peptides capable of binding to a specific polypeptide sequence in order to generate rationally designed antipeptide peptides, for example see Hurby et al., “Application of Synthetic Peptides Antisense Peptides”, In Synthetic Peptides, A User's Guide, W. H. Freeman, NY, pp. 289-307 (1992), and Kaspczak et al., Biochemistry 28:9230-8 (1989).

Anti-peptide peptides can be generated in one of two fashions. First, the anti-peptide peptides can be generated by replacing the basic amino acid residues found in the EE polypeptide sequence with acidic residues, while maintaining hydrophobic and uncharged polar groups. For example, lysine, arginine, and/or histidine residues are replaced with aspartic acid or glutamic acid and glutamic acid residues are replaced by lysine, arginine or histidine.

Diagnostic Methods

Also contemplated herein are diagnostic methods that use the EE polypeptides, EE polynucleotides, EE fusion proteins and EE antibodies described herein.

Samples used in the diagnostic methods are samples obtained from a subject that is suspected of having, or having has, E. ewingii infection. Subjects not infected with EE do not have EE DNA, mRNA, protein, or antibody.

The subject may be a human or any animal that can be infected with E. ewingii. Such subjects include, but should not be limited to, humans, horses, deer, cattle, pigs, sheep, dogs, cats and chicken.

The test sample may be a biological fluid such as serum, plasma, whole blood, urine, or saliva, or may be tissue, cells, protein or membrane or nucleic acid extracts of cells, obtained from a subject. The sample used in the methods will vary based on the assay format, the detection method and the nature of the tissues, cells or extracts to be assayed.

For example, in some embodiments, it is advantageous to use more than one EE polypeptide as antigens in diagnostic methods. Since currently no E. ewingii-specific serodiagnosis is available, for serodiagnosis of E. ewingii infection in either humans and animals, use of a combination of EEOMP-1s (i.e. EE OMP-1 proteins or functional derivatives thereof) as the antigen can provide sensitive and specific serodiagnosis. Use of multiple EE polypeptides can provide more sensitive diagnosis than the use of a single EE OMP-1 antigen. Not all humans and dogs develop antibodies to every EEOMP-1 protein. Therefore, the use of a combination of EE polypeptides (e.g. a combination of EE polypeptides corresponding to all OMP-1 proteins) as antigens provides a more comprehensive coverage of antibody responses. Furthermore, the entire EEOMP-1 amino acid sequences disclosed herein can help optimize peptide antigens to provide desired specificity and sensitivity to detect potentially diverse E. ewingii strains in the field.

Diagnostic Methods Using EE Polypeptides and Antibodies

The present invention also provides a method for detecting the presence of antibodies specific to E. ewingii in a test sample. The method includes contacting a test sample suspected of comprising antibodies specific for E. ewingii with one or more E. ewingii polypeptides, as described herein, under conditions that allow polypeptide/antibody complexes to form; and assaying for the formation of a complex between antibodies in the test sample and the one or EE polypeptides. Accordingly, detecting the formation of such a complex is an indication that antibodies specific for E. ewingii are present in the test sample.

Another aspect provides for a method for detecting the presence of E. ewingii polypeptides in a test sample. The method includes contacting a test sample suspected of comprising E. ewingii polypeptides with one or more E. ewingii antibodies that specifically bind to at least one epitope of an E. ewingii OMP protein or peptide, as described herein, under conditions that allow polypeptide/antibody complexes to form; and assaying for the formation of a complex between polypeptides in the test sample and the one or EE antibodies. Accordingly, detecting the formation of such a complex is an indication that E. ewingii polypeptides are present in the test sample.

The presence of EE polypeptides, or antibodies to EE polypeptides, may indicate exposure to E. ewingii, the potential need for therapy of an affected subject, or EE contamination of a biological sample.

For ease of detection, the isolated EE polypeptide or antibody can be attached to a substrate such as a column, plastic dish, matrix, or membrane, such as nitrocellulose. The test sample may be untreated, subjected to precipitation, fractionation, separation, or purification before combining with the isolated polypeptide. Conditions for incubating an EE polypeptide or antibody with a test sample vary. Incubation conditions depend on the format employed in the assay, the detection methods employed, and the type and nature of the antibody used in the assay.

Interactions between antibodies and polypeptide can be detected in a number of ways well know to those skilled in the art. These include, but are not limited to, radiometric, calorimetric, or fluorometric means, size-separation, or precipitation. These assays include, but are not limited to, a microtiter plate assay, a reversible flow chromatographic binding assay, an enzyme linked immunosorbent assay, a radioimmunoassay, a hemagglutination assay, a western blot assay, a fluorescence polarization immunoassay, an indirect immunofluorescence assay, diffusion based Ouchterlony, or rocket immunofluorescent assays. Examples of such assays can be found in Chard, An Introduction to Radioimmunoassay and Related Techniques, Elsevier Science Publishers, Amsterdam, The Netherlands (1986); Bullock et al., Techniques in Immunocytochemistry, Academic Press, Orlando, Fla. Vol. 1 (1982), Vol. 2 (1983), Vol. 3 (1985); Tijssen, Practice and Theory of Enzyme Immunoassays: Laboratory Techniques in Biochemistry and Molecular Biology, Elsevier Science Publishers, Amsterdam, The Netherlands (1985). In one example, detection of the antibody-polypeptide complex is by addition of a secondary antibody that is coupled to a detectable tag, such as for example, an enzyme, fluorophore, or chromophore. Formation of the complex is indicative of the presence of E. ewingii proteins, or anti-E. ewingii antibodies in the subject from whom the test sample was obtained. Thus, the method is used to determine whether a subject is infected with E. ewingii.

In some embodiments, the method employs an enzyme-linked immunosorbent assay (ELISA) or a Western immunoblot procedure. Such methods are relatively simple to perform and do not require special equipment as long as membrane strips are coated with a high quality antigen. Accordingly, it is possible to use a recombinant form of the EE polypeptides since such proteins and peptides, typically, are more pure and consistent in quality than a their purified form.

Diagnostic Methods Using EE Primers and Probes

The probes and primers described herein can be designed and used diagnostically for determining whether a subject has been infected with an E. Ewingii species. Therefore, also provided are methods of detecting the presence of EE polynucleotides in a sample.

Analysis of nucleic acid specific to EE can be by PCR techniques or hybridization techniques (see, for example, Molecular Cloning: A Laboratory Manual, second edition, edited by Sambrook, Fritsch, & Maniatis, Cold Spring Harbor Laboratory, 1989; Eremeeva et al., J. Clin. Microbiol. 32:803-810 (1994) which describes differentiation among spotted fever group Rickettsiae species by analysis of restriction fragment length polymorphism of PCR-amplified DNA). For example, methods of using nucleic acid probes to analyze EE genomic DNA via PCR analysis have been described in Chen et al., J. Clin. Microbiol. 32:589-595 (1994).

In one embodiment, the method includes: a) contacting the sample with the above-described nucleic acid probe, under specific hybridization conditions such that hybridization occurs, and b) detecting the presence of the probe bound to the nucleic acid molecule, wherein detecting the presence of such binding is indicative of the presence of E. ewingii in the sample, and therefore, the subject.

The screening and diagnostic methods of the invention that employ probes do not require that the entire EE protein coding sequence be used for the probe. Rather, it is only necessary to use a fragment or length of nucleic acid that is sufficient to detect the presence of the EE nucleic acid in a DNA preparation from a subject.

Alternatively, in another embodiment, the method of detecting the presence of EE nucleic acid in a sample may include: a) amplifying the nucleic acid in the sample with one or more of the above-described primer sets specific for one or more portions of the EE omp-1 gene cluster, SEQ ID NO: 1 using PCR techniques and b) detecting the presence of the amplified nucleic acid molecules, wherein the presence of a PCR product having a sequence or length which corresponds to the sequence or length of the portion of the EE omp-1 gene which is located between the primer set is indicative of the presence of E. ewingii in the sample.

The resulting PCR amplification products can be separated by size by any method, such as gel electrophoresis, and detection of an appropriately sized product indicates E. ewingii infection. One skilled in the art would select the nucleic acid probe according to techniques known in the art as described above.

Methods for preparing nucleic acid extracts of cells are well known in the art and can be readily adapted in order to obtain a sample which is compatible with the method utilized.

Kits

In another embodiment of the present invention, a kit is provided which contains all the necessary reagents to carry out the previously described methods of detection.

The kit can comprise one or more isolated EE polypeptides. For ease of detection, the polypeptides may be attached to a substrate such as a column, plastic dish, matrix, or membrane, such as nitrocellulose. The kit may further comprise a conjugate comprising a binding partner of the polypeptide. The binding partner can be a biomolecule, such as a secondary antibody, for detecting interactions between the isolated polypeptide and antibodies immuno-specific to E. ewingii, in a test sample. In some embodiments, the biomolecule is coupled to a detectable tag such as an enzyme, chromophore, fluorophore, or radio-isotope. The kit can be used by contacting a test sample with the EE polypeptide under conditions that permit formation of antigen-antibody complexes. Then the biomolecule is added and the presence or absence of any resulting antigen-antibody complexes is detected by assaying for a change in the sample, for example, by observing the formation of a precipitate in the sample, the presence of radioactivity on the substrate, or a color change in the sample or on the substrate. Detecting such a change is indicative that the test sample contains anti-E. ewingii antibodies.

In other embodiments the kit can comprise one or more of an above-described antibodies. The kit can further comprise a conjugate comprising a binding partner of the antibody. The binding partner can be a biomolecule, such as a secondary antibody, for detecting interactions between the antibodies and the EE OMP protein or peptide in the test sample. In some embodiments, the biomolecule is coupled to a detectable tag such as an enzyme, chromophore, fluorophore, or radio-isotope. The kit can be used by contacting a test sample with the EE antibody under conditions that permit formation of antigen-antibody complexes. Then the biomolecule is added and the presence or absence of any resulting antigen-antibody complexes is detected by assaying for a change in the sample, for example, by observing the formation of a precipitate in the sample, the presence of radioactivity on the substrate, or a color change in the sample or on the substrate. Detecting such a change is indicative that the test sample contains E. ewingii, or components of E. ewingii.

In other embodiments, the above described kits may further comprises one or more other reagents such as: wash reagents and reagents capable of detecting the presence of bound antibodies. Examples of detection reagents include, but are not limited to, labeled secondary antibodies, or in the alternative, if the primary antibody is labeled, the chromophoric, enzymatic, or antibody binding reagents which are capable of reacting with the labeled antibody.

Also provided are kits for detecting the presence of EE nucleic acid in a sample, which include at least one of the above-described omp-1 specific nucleic acid probes or primers. In one embodiment, the kit further include: reagents for DNA extraction from the test sample, reagents for PCR amplification, wash reagents and reagents capable of detecting the presence of bound nucleic acid probe. Examples of detection reagents include, but are not limited to radiolabelled probes, enzymatic labeled probes (horseradish peroxidase, alkaline phosphatase), and affinity labeled probes (biotin, avidin, or steptavidin).

In detail, the kit may be a compartmentalized kit in which reagents are contained in separate containers. Such containers include small glass containers, plastic containers or strips of plastic or paper. Such containers allow the efficient transfer of reagents from one compartment to another compartment such that the samples and reagents are not cross-contaminated and the agents or solutions of each container can be added in a quantitative fashion from one compartment to another. Such containers will include a container which will accept the test sample, a container which contains the probe or primers used in the assay, containers which contain wash reagents (such as phosphate buffered saline, Tris-buffers, and the like), and containers which contain the reagents used to detect the hybridized probe, bound antibody, amplified product, or the like.

One skilled in the art will readily recognize that the nucleic acid probes described in the present invention can readily be incorporated into one of the established kit formats which are well known in the art. One skilled in the art will readily recognize that the EE polypeptides and antibodies described in the present invention can readily be incorporated into one of the established kit formats which are well known in the art.

Immunogenic Compositions and Vaccines

The present invention also relates to immunogenic compositions comprising one or more E. ewingii OMP proteins, or immunogenic fragments and variants thereof, or a fusion protein containing same, collectively referred to herein as an “immunogenic EE polypeptide” and a pharmaceutically acceptable carrier

The immunogenic EE polypeptides, as used herein, comprise an epitope-bearing portion of an EE OMP protein. In some embodiments, the epitope-bearing portion comprises a sequence of at least 6 consecutive amino acids within the variable loops of OMP proteins shown in Tables 2-5. Some examples of immunogenic fragments (or peptides) are shown in Table 6.

An immunogenic EE polypeptide is a polypeptide that is capable of producing antibodies with a specific binding affinity to E. ewingii in a subject to whom the immunogenic composition has been administered.

In another embodiment, the present invention relates to a vaccine comprising an immunogenic EE polypeptide, together with a pharmaceutically acceptable diluent, carrier, or excipient, wherein the immunogenic EE polypeptide is present in an amount effective to elicit a beneficial immune response in a subject to EE. The immunogenic EE polypeptide may be obtained as described above and using methods well known in the art.

In another embodiment, the present invention relates to a vaccine comprising an EE nucleic acid (e.g., DNA) or a segment thereof (e.g., a segment encoding an immunogenic EE polypeptide) together with a pharmaceutically acceptable diluent, carrier, or excipient, wherein the nucleic acid is present in an amount effective to elicit, in a subject, a beneficial immune response to EE. The EE nucleic acid may be obtained as described above and using methods well known in the art.

In a further embodiment, the present invention relates to a method of producing an immune response which recognizes EE in a host, comprising administering to the host one or more of the above-described immunogenic EE polypeptides.

In some embodiments, the host or subject to be protected is selected from the group consisting of humans, horses, deer, cattle, pigs, sheep, dogs, cats and chickens. In some embodiments, the animal is a human or a dog.

In a further embodiment, the present invention relates to a method of preventing or inhibiting ehrlichiosis in a subject comprising administering to the subject the above-described vaccine, wherein the vaccine is administered in an amount effective to prevent or inhibit Ehrlichiosis. The vaccine of the invention is used in an amount effective depending on the route of administration. Although intra-nasal, subcutaneous or intramuscular routes of administration are suitable, the vaccine of the present invention can also be administered by an oral, intraperitoneal or intravenous route. One skilled in the art will appreciate that the amounts to be administered for any particular treatment protocol can be readily determined without undue experimentation. Suitable amounts are within the range of 2 μg of the EE vaccine per kg body weight to 100 micrograms per kg body weight (preferably, 2 μg to 50 μg, 2 μg to 25 μg, 5 μg to 50 μg, or 5 μg to 10 μg).

Examples of vaccine formulations including antigen amounts, route of administration and addition of adjuvants can be found in Kensil, Therapeutic Drug Carrier Systems 13:1-55 (1996), Livingston et al., Vaccine 12:1275 (1994), and Powell et al., AIDS RES, Human Retroviruses 10:5105 (1994). The vaccine of the present invention may be employed in such forms as capsules, liquid solutions, suspensions or elixirs for oral administration, or sterile liquid forms such as solutions or suspensions. Any inert carrier may be used, such as saline, phosphate-buffered saline, or any such carrier in which the vaccine has suitable solubility properties. The vaccines may be in the form of single dose preparations or in multi-dose flasks which can be used for mass vaccination programs. Reference is made to Remington's Pharmaceutical Sciences, Mack Publishing Co., Easton, Pa., Osol (ed.) (1980); and New Trends and Developments in Vaccines, Voller et al (eds.), University Park Press, Baltimore, Md. (1978), for methods of preparing and using vaccines.

The vaccines of the present invention may further comprise adjuvants which enhance production of antibodies and immune cells. Such adjuvants include, but are not limited to, various oil formulations such as Freund's complete adjuvant (CFA), the dipeptide known as MDP, saponins (ex. QS-21, U.S. Pat. No. 5,047,540), aluminum hydroxide, or lymphatic cytokines. Freund's adjuvant is an emulsion of mineral oil and water which is mixed with the immunogenic substance. Although Freund's adjuvant is powerful, it is usually not administered to humans. Instead, the adjuvant alum (aluminum hydroxide) may be used for administration to a human. Vaccine may be absorbed onto the aluminum hydroxide from which it is slowly released after injection. The vaccine may also be encapsulated within liposomes according to Fullerton, U.S. Pat. No. 4,235,877.

The present invention will be better understood by reference to the following examples which are offered by way of illustration, not limitation.

EXAMPLE Identification of 19 Polymorphic Major Outer Membrane Protein Genes and Their Immunogenic Peptides in Ehrlichia ewingii

Since ehrlichial infections induce significant antibody titers in non-immunocompromised patients, and nonexposed people seldom have antibodies reactive to Ehrlichia spp., serologic tests are considered reliable tests for confirmation of ehrlichioses, especially when ruling out the possibility of infection. In order to develop a serologic test using major antigens of E. ewingii, genes encoding these proteins must be first identified.

There are a number of challenges to sequencing genes encoding E. ewingii outer membrane proteins. First, E. ewingii DNA amount available from naturally or experimentally infected dogs is limited. Second, E. ewingii DNA concentration in the total DNA extracted from the blood is very low, due to a small amount of bacteria present in the blood and is difficult to enrich due to obligatory intracellular nature of this bacterium. Third, DNA sequences encoding OMP-1/P28/P30/MAP1, are too divergent to design universal primers. Prior to the instant application, only a partial sequence (505 bp) of a single member OMP-1 family p28-19 has been cloned in E. ewingi, and the sequence of other E. ewingii outer membrane proteins, or the genes encoding such proteins, remained unknown.

The purposes of the reported study were to i) determine the E. ewingii omp-1 gene family, ii) determine each OMP-1-specific peptide, and iii) analyze all OMP-1 synthesized peptides for antigenicity.

We systematically identified the entire E. ewingii OMP-1 genomic locus. Using nested touchdown PCR and a primer walking strategy, we found 19 omp-1 paralogs in E. ewingii. These genes are arranged in tandem downstream of tr1 and upstream of secA in a 24-kb genomic region. Predicted molecular masses of the 19 mature E. ewingii OMP-1s range from 25.1 to 31.3 kDa with isoelectric points of 5.03 to 9.80.

These multigene family proteins are composed of conserved and unique amino acid sequences. This led to our idea that, rather than the whole OMP-1 protein, antigenic OMP-1 peptides unique to E. ewingii can provide better serologic diagnostic antigens. Therefore, differences of the genomic loci and sequences of E. ewingii omp-1s with those of E. chaffeensis omp-1/p28, E. canis p30s and E. ruminantium map 1 were determined to design antigenic OMP-1 peptides specific to E. ewingii.

Based on comparative sequence analyses among OMP-1s from E. ewingii and the three other Ehrlichia spp. (FIG. 8), each E. ewingii OMP-1 oligopeptide predicted to be antigenic, bacterial surface exposed, unique in comparison to the other E. ewingii OMP-1s, and distinct from other Ehrlichia spp. was synthesized to perform ELISA. Plasma from E. ewingii-experimentally infected dogs significantly reacted with most of the OMP-1-specific peptides, indicating that multiple OMP-1 proteins were expressed and immunogenic in infected dogs. The results support the utility of the tailored OMP-1 peptides as E. ewingii serologic test antigens.

Materials and Methods

E. ewingii omp-1 Cluster Amplification, Sequencing, and Assembly.

An EDTA-treated whole-blood specimen (−200 μl) collected in April 2005 from an 8-week-old male German Shepherd mixed breed dog in Ohio was used for DNA extraction. DNA was extracted using the QIAamp blood kit (QIAGEN, Valencia, Calif.) and used as the template for the entire amplification and sequencing process. E. ewingii infection of the dog was confirmed by PCR and sequencing of the 16S rRNA of E. ewingii as well as observation of bacterial inclusions (morulae) in granulocytes in the blood and joint fluid smear. PCR analysis showed that the dog was negative for infection by A. phagocytophilum, E. chaffeensis and E. canis (Qingming Xiong, Weichao Bao, and Yasuko Rikihisa, unpublished data).

The omp-1 fragments were amplified using first touchdown PCR (Roux, K. H. et al. (1997) Methods Mol. Biol. 67:39-45) with the primer pairs F1 and R7, F8 and R14, and F15 and R21 (Table 7). The PCR reaction (50 μl) included 0.5 μl template DNA corresponding to 4 μl of the original blood sample, 10 pmol of each primer, 0.2 mM deoxynucleoside triphosphate mixture, 2.5 U high-fidelity Taq polymerase (Invitrogen, Carlsbad, Calif.), and 1.5 mM MgCl2. Amplification was performed with a program (94° C. for 3 min; a gradient over 10 cycles of 94° C. for 0.5 min, 64° C. for 0.5 min, and 72° C. for 2 min, the annealing temperature decreased by 1° C./cycle; 35 cycles of 94° C. for 0.5 min, 55° C. for 0.5 min, and 68° C. for 9 min; and finally 68° C. for 9 min). The nested PCRs were performed using the first PCR products as template with 21 pairs of degenerate primers, with amplicons of approximately 1,500 bp that each overlapped approximately 200 bp according to E. chaffeensis and E. canis omp-1 clusters (Table 7, FIG. 1). Conditions of the nested PCR were similar to the first PCR except that Taq polymerase was used and the elongation step was at 72° C. for 2 min. The nested PCR products were run on a 1% agarose gel with TAE buffer (40 mM Tris-acetate, 1 mM EDTA, pH 8.0). The amplified DNA fragments were recovered from the gel with the QIAEX II Gel Extraction kit (QIAGEN) and directly sequenced with the nested PCR primers (FIG. 1). For fragment 3, two touchdown PCRs with high-fidelity Taq polymerase were performed using the infected dog blood DNA as template and one of the following primer pairs: forward primer P28-19F and primer R21, or the forward primer designed based on the 3′ end of fragment 2 (Specific 4F) and reverse primer P28-19R (Table 7). Nested amplification of these two PCR products and direct sequencing were used for subsequent design of new specific primers. Direct sequences obtained ranged from 250 to 800 bp. The poly G/C or A/T regions (FIG. 1) were cloned using the TA cloning kit (Invitrogen), and the plasmid inserts were sequenced. All sequencing data were assembled using the SeqMan program of DNASTAR software (DNASTAR Inc., Madison, Wis.).

TABLE 7 Primers used. SEQ ID SEQ ID NO: Degenerate primers NO: E. ewingii specific primers 44 F1: 86 Specific 1F: CGYATYATGAGAGGTATGAG GTACTTTGCCATTCCCAGAGA 45 R1: 87 Specific 1R: AGGRTCTATATGTTTTGGTGCT GATCTACTCCAAACCCAAGAC 46 F2: 88 Specific 1RA: TTGYATTGGTATAGGGCAAGGA GGAATTACTGCTCCAATAGTAGC 47 R2: 89 Specific 2F: CTCAAATTTTTTACCRAATAAACCATG GTTGATGGGTATTACCACAGAG 48 F3: 90 Specific 2R: CRTATTCATGTTTAGGRTTTGG CACCTAGTATTTTGCTGAAGCT 49 R3: 91 Specific 3F: AGTTGCTAWAGCAAARTACTC TTACTTACCCACTATCTGGTAAC 50 F4: 92 Specific 3R: TAGAASTTGAAGCTTTTTATGAG TAATTTCCCCTGACCTGCAAAC 51 R4: 93 Specific 0F: GATATACCRTTRTTTTTTGCTACAG CAAACCAGTTTATTGACTGGGCAT 52 F5: 94 P28-19F: AARTWCTTTGCTATACCACGTA CAATCATGCTAAATGCATGTTATGAC 53 R5: 95 P28-19R: TCTATTTCTAAYCTTGGYCCTTG GGATTTATGCTATTAAACATTGACAC 54 F6: 96 Specific 0FA: ATRGGYCTTRCAAMTGATGTTAC TTCAAGCTAAGCTAGGTTTAGG 55 R6: 97 Specific 1RB: YTTAYTCCARCTTCACCACCA CATATTAACTCAATCAAGTAAACACAC 56 F7: 98 Specific 1FA: GCARTAGCWACACTTAATGTTG CCTCTTACCTCAAATTTAGTTCTC 57 R7: 99 Specific 2RA: CCTGGTTTATATTGMCCACTT TTCACCTATACCTAAGCATACATAAG 58 F8: 100 Specific 3FA: GAGTATTTYGGTRGTGAATTTGG GTCATGCTATATAGATGATACTGTG 59 R8: 101 Specific 4F: RAAATCTCCTCCTAKTCCTGC TCCCTTATGTTTTTGTATTCCTATAC 60 F9: 102 Specific 4R: CTGTMATGAGAAAYGACGGGTT CCATCCATAGCATAACCGATAC 61 R9: 103 Specific 2FA: TAYYAATKTCAACAGAATCAAYATC CTGTTATGAGAAATGACGGAGTTTC 62 F10: 104 Specific 3FB: CAATAYAAACCCAGTGTTTCTG CGTACATAGAGTGTTATAGGCAATTC 63 R10: 105 Specific 0FB: GRATAAGTAAYACCTAAYTTACC GGTTTAAGTATATGAGTTATAAGAAGGT 64 F11: 106 Specific 1FB: TAYRGTMAATGGCTGCTATGAT ATGCACAGGCATTGGTGGAGA 65 R11: 107 Specific 2RB: AAGTGTAGCWACTGCRGATGT GTATATATGCATATGTAACATGCAAG 66 F12: 108 P28-19RA: TACCATMAAGTAATRGGCAATCA GGCATGTACTTTCCGCTGATG 67 R12: 109 Specific 3RA: AYTTCTCCGCCAAAGTATCCA CTTTACTACTTTCTGATTCACGT 68 F13: 110 Specific 5F: GCTCCTCAAACCACATCTGC TGCTTTTATTGGTGGGCACTTTC 69 R13: 111 Specific 6R: TAKGGTTTATAGCKTCAAACATG TAAGTTTTTTGCATTATCTCGTGAAG 70 F14: 112 Specific 7F: TTYTCWCCTTACATATGTGCAG TTGCACAAAAAATCTTTGGCTCAG 71 R14: 113 Specific 7R: CARTTCATATTTACACCWGAAAKAGTGAA ATTAACGCATTTGCATGTAGTAGTGTG 72 F15: 114 Specific 4FA: GTWTTTAMWTTGTAKKTTTACTACTGTT CAAGGAAAACTAGGTATAAGTTACTC 73 R15: 115 Specific 4RA: CTAYTCTTGGRCCACCCATTG AAGACTGGTATGGTAAGACTGTC 74 F16: 116 Specific 6F: TAGGGTTTGCAGGAGCTATTG ACACCCCATAACACCACTAAAAG 75 R16: 117 Specific 6RA: AATTTTAGGRYTTRTAGCTTCAAAC GTTTGTTAACTACCCTGTAAAGTC 76 F17: 118 Specific 7RA: TATGYGCAGGTRTTGGTACTGA GATAGTACAAACCTGTAAGATGTTAC 77 R17: 119 Specific 3RB: GAWGCTTCTGGGCTTATRGAGT AACCTAAATTGCCTATCGATATCATC 78 F18: 120 Specific 7RB: CAAATCCTAAAATTTCTTAYCAAGGA TCAACCGTAATATTTAGTGTAGCATC 79 R18: 121 Specific 6FB: TYAGTAATTTTTCAGCTGAAGAAAC CAATATGGCTTTTAGTATCTTGTACATC 80 F19: 122 Specific 7RD: GCAAAAYTGCTTGCATAWGTAG TACACTACTTATTGGTATTGTTGGTAG 81 R19: 123 Specific 6RA2: ATTTYTCAGAAGARTATGTTCCA TATGTTGTTTGGAGGTGGTTACTATC 82 F20: 124 Specific 6FA: GAGTMAAAAAYTTTAAYAATRTCTTCTC CCTATATCTAAGTTAGCTAATGCCGAAG 83 R20: 125 Specific 7RC: AAAATATCCATTRTAGCTTACCT TTTTTGTTTTTCTGTTTTGTGTAACCTGTG 84 F21: 126 Specific 2FB: ATGWTAAATTYATGYTTAAGTTGCA CTGGGCATTCTTCAATAGATGCTC 85 R21: SCCYGTYTTCATTTCGGATATC

E. ewingii omp-1 Cluster Analysis.

Artemis (38) was used to identify the ORFs in the newly obtained E. ewingii omp-1 cluster. The ORFs longer than 100 amino acids were blasted against the NCBI GenBank database to find homologs. The Artemis comparison tool (Carver, T. J., et al. (2005) N Engl J Med 341:148-155) was used to analyze the synteny of the E. ewingii omp-1 cluster to that of E. chaffeensis, E. canis, and E. ruminantium. To search for repeat regions in the E. ewingii omp-1 cluster and between the E. ewingii omp-1 cluster and E. chaffeensis, E. canis, or E. ruminantium's omp-1 clusters, dot plot analysis was performed with Java Dot Plot Alignments program (JDotter) athena.bioc.uvie.ca/index.php.

Phylogenetic Analysis.

The deduced amino acid sequences of E. chaffeensis OMP1/P28s, E. canis P30s, E. ruminantium MAP1s, and E. ewingii OMP-1s were aligned using the MegAlign program of DNASTAR software. Then, phylogenetic analysis was performed with PHYLIP software (version 3.66) (Felsenstein, J. (1989) Cladistics 5:164-166). The phylogram was constructed using the Neighbor-Joining method with Kimura's formula and 1,000 bootstrap replications were conducted to evaluate the reliability of the tree (Felsenstein, J. (1989) Cladistics 5:164-166).

Peptide Synthesis and Peptide-Pin ELISA Analysis.

The peptide libraries were synthesized using non-cleavable multipin synthesis technology and fluorenylmethoxycarbonyl chemistry (Mimotopes Pty. Ltd., Victoria, Australia) (Geysen, H. M. (1990) Southeast Asian J Trop Med Public Health 12:523-533). After disruption of the peptide-pins with 0.1 M sodium phosphate buffer containing 1% SDS (pH 7.2) and 0.1% β-mercaptoethanol and hot (temperature) water washes, nonspecific binding sites were blocked with 200 μl of 3% skim milk (Becton, Dickinson and Co., Sparks, Md.) in PBS/Tween-20. Blocking was carried out in 96-well plates for 1 h at room temperature. Sets of peptide-bound pins were washed once with PBS containing 0.1% (v/v) Tween-20 for 10 min and then incubated in the blocking solution (1:100 dilution) with plasma from E. ewingii- or E. chaffeensis-infected dogs, or pre-infection dog plasma, at 4° C. overnight. Samples from dogs 2119, 2185, and 2405 were collected at days 206, 109, and 110 post-infection, respectively, with E. ewingii. Samples from dog CTUALJ (E. chaffeensis IFA titer, 1:2,560), dog 1425 (E. chaffeensis IFA titer, 1:320), and dog 3918815 (E. chaffeensis IFA titer, 1:2,560) were collected at days 41, 121, and approximately 210 post-infection, respectively, with E. chaffeensis. Dogs 2119, 2185, and 1425, and preinfection plasma from dog CTUALJ, were used as negative controls. After washing four times as described above, the peptide pins were placed in wells filled with horseradish peroxidase-labeled goat anti-dog IgG (H+L) (Kirkegaard & Perry Laboratories, Gaithersburg, Md.) diluted at 1:1,000 in PBS/Tween-20 and incubated for 1 h at room temperature. Samples were washed four times, and then the peptide pins were incubated for 20 min at room temperature with horseradish peroxidase substrate 2,2′-azido-di-(3-ethyl)-benzthiazoline-6-sulfonic acid (Sigma, St. Louis, Mo.) in 70 mM citrate buffer (pH 4.2) applied to a new plate. Absorbance at 405 and 492 nm was measured in an ELISA plate reader (Molecular Devices, Sunnyvale, Calif.). Each assay was repeated at least three times. The cut-off OD_(405nm)-OD_(492nm) value for positive reaction was set as the mean OD_(405nm)-OD_(492nm)+three standard deviations of the negative control plasma.

Results

E. ewingii omp-1 Cluster Sequencing and Assembly.

Forty-two degenerate primers were initially designed based on the conserved regions of the aligned omp-1/p30 clusters of E. chaffeensis and E. canis (Table 7, FIG. 1). To efficiently utilize the limited amount of E. ewingii DNA, the putative omp-1 cluster was divided into three overlapping fragments of approximately 9 kb, estimated based on homologous regions of E. chaffeensis and E. canis. The first touchdown PCR (Roux, K. H. et al. (1997) Methods Mol. Biol. 67:39-45) was designed to amplify the three putative long fragments. The PCR products were then used as templates for the 21 nested touchdown PCRs using degenerate primer pairs (Table 7, FIG. 1). As a result only four PCRs showed bands ranging from ˜200 to ˜1,500 bp: F1 and R1 (˜700 bp), F4 and R4 (˜1,000 bp), F7 and R7 (˜1,500 bp), and F11 and R11 (˜220 bp). The PCR products were directly sequenced. The result showed they belong to the omp-1/p28/p30 family. For regions covered by fragments 1 and 2 (FIG. 1), E. ewingii-specific omp-1 primers were designed based on the four newly obtained E. ewingii omp-1 DNA sequences (Table 7). However, because no omp-1 was amplified in fragment 3 using degenerate primers, two touchdown PCRs with high-fidelity Taq polymerase were performed using the infected dog blood DNA as template and one of the following primer pairs: forward primer P28-19F designed based on the conserved region of E. ewingii p28-19 DNA sequences (Gusa, A. A., et al. (2001) J Clin Microbiol 39:3871-3876) and primer R21, or the forward primer designed based on the 3′ end of fragment 2 and reverse primer P28-19R designed based on the conserved region of the p28-19 DNA sequence. Nested amplification of these two PCR products and direct sequencing were used for subsequent design of new specific primers. This process was repeated for three fragments until we encountered the poly G/C or A/T regions in fragments 2 and 3. The poly A/T and poly C/G tracts (FIG. 1) were determined by TA cloning and sequencing 10 and 22 plasmid inserts, respectively, in each of two regions. The poly G tract had 9-13 Gs (the number of Gs was distributed among the 22 sequenced clones as follows: 9G=1, 10G=4, 11G=7, 12G=2, and 13G=8), and was reported as 13G according to SeqMan software. The poly A tract had 10-13 As (the number of As was distributed among the 10 sequenced clones as follows: 10A=1, 11A=3, 12A=3, and 13A=3), and was reported as 12A according to SeqMan software. The predominant in-frame sequences in each region were deposited in GenBank. The final sequence assembled from the entire E. ewingii omp-1 locus contained 24,126 bp (GenBank accession No. EF116932). The G+C content of the E. ewingii omp-1 cluster was 28.74%, which is similar to that of E. canis, E. chaffeensis, and E. ruminantium (29.36%, 30.95%, and 27.19%, respectively). E. ruminantium is the causative agent of heartwater in ruminants in Africa and Caribbean countries. Sequence identity of the entire E. ewingii omp-1 cluster relative to E. canis, E. chaffeensis, and E. ruminantium was 28.4%, 22.2%, and 14.8%, respectively.

Features of the OMP Cluster Structure are Conserved Among the Ehrlichia Species.

The Artemis software analysis showed that each of the 24 ORFs encode more than 100 amino acids in the assembled E. ewingii omp-1 DNA fragment. One of the 24 ORFs in the middle of the cluster was short (390 bp), partially overlapped with two other ORFs in the opposite orientation, and had no homolog in the GenBank database, and thus this ORF was not included in the Figures or Table 1. The 23 ORFs were numbered ORF 1 to 23. These 23 genes were arranged in tandem except for three ORFs (ORF19, 20, and 21) that were in the opposite orientation. Nineteen of these 23 ORFs encoded proteins homologous to OMP-11P28/MAP1 of E. chaffeensis, E. canis, or E. ruminantium. Most closely related proteins to each EEOMP-1 are listed in Table 8.

We numbered them E. ewingii (EE)OMP-1-1 to EEOMP-1-19 (FIG. 2). The sequence similarity and molecular mass of EEOMP-1-8 was less than that of the other EEOMP-1s. There is a protein ortholog of EEOMP-1-8, UN3, in the E. chaffeensis and E. canis genomes with unknown function. In E. ruminantium, the EEOMP-1-8 ortholog is MAP1-9. The protein encoded by the first ORF (ORF1) is homologous to a hypothetical transcriptional regulator, and the protein encoded by the last ORF (ORF23) is homologous to SecA. Proteins encoded by the other two ORFs (ORF4 and ORF22) are most homologous to two E. chaffeensis and E. canis peptides, UN2 and UN4, with unknown function, as well as two E. ruminantium peptides, UN1 and UN2, whose function is unknown. The p28-19505 bp sequence was a part of EEomp-1-16 (Table 1). Intergenic spaces between omp-1 genes ranged from 6 to 1,343 bp (Table 1). At the 5′ half of each OMP cluster, 14 genes (un2 to EEomp-1-13 in E. ewingii) were linked by short intergenic spaces ranging from 6 to 26 bp (Table 1). Eight genes in the 3′ half (EEomp-1-14 to EEomp-1-19) were connected by longer intergenic spaces ranging from 301 to 808 bp. Thus, features of the OMP cluster structure were conserved among E. ewingii, E. canis, E. chaffeensis, and E. ruminantium, with the exception of the opposite orientation of three genes, instead of one gene (E. canis and E. ruminantium) or two genes (E. chaffeensis) at the 3′ end.

After removal of the signal peptide sequence, predicted molecular masses of mature E. ewingii OMP-is ranged from 25.1 to 31.3 kDa. The predicted signal peptides ranged from 21 to 32 amino acids. The predicted isoelectric points of the mature OMP-is were 5.03 to 9.80. Properties of the ORFs of the E. ewingii omp-1 cluster, including predicted signal peptide lengths, molecular masses of mature proteins and isoelectric points, are shown in Table 1.

Omp-1/p28/mapl Gene Clusters Display Synteny at the 5′ End.

The synteny among entire OMP-1 gene clusters of E. ewingii and three related Ehrlichia species was analyzed by Artemis Comparison Tool, and the results are shown in FIG. 3. The genes at the 5′ end of the omp-1 clusters were more highly conserved than genes in the central region or 3′ end (FIG. 3). Previously we defined three repeat sequence regions, α, β and γ, in omp-1 clusters of E. chaffeensis and E. canis (Ohashi, N., et al. (2001) Infect Immun 69:2083-2091). The dot plot analysis of the E. ewingii omp-1 cluster and the dot plot between E. ewingii and E. ruminantium only revealed β and γ repeat regions (FIG. 4). The β repeat region in E. ruminantium was shorter than that of E. chaffeensis and E. canis. The dot plot analysis between E. ewingii and E. chaffeensis and between E. ewingii and E. canis showed three clear repeat regions, indicating that the a region is expanded in E. canis and E. chaffeensis.

The phylogenetic analysis of all 79 OMPs of E. ewingii, E. chaffeensis, E. canis, and E. ruminantium is shown in FIG. 5. The previously defined a and β1 regions in the E. chaffeensis omp-1 cluster (Ohashi, N., et al. (2001) Infect Immun 69:2083-2091) encoded five (P28, OMP-1F, -1D, -1C, and -1E) and four (OMP-1H, -1A, -1S, and -1Z) proteins, respectively, and α and β1 regions in the E. canis p30 cluster encoded six (P30, P30-1, P30-2, P30-3, P30-4, and P30a) and four (P30-6, P30-5, P30-7, and P30-8) proteins, respectively. However, in E. ewingii, the α and β1 regions each encoded two proteins (EEOMP-1-15, EEOMP-1-16 and EEOMP-1-12, EEOMP-1-13, respectively). In E. ruminantium, the a region encoded only one protein (MAP1) and the β1 region encoded two proteins (MAP1-2, MAP1-3) (FIG. 5).

EEOMP-1-8 and MAP1-5 were far removed from the remaining OMP-1s, raising the possibility that they do not belong to the OMP cluster (FIG. 5). EEOMP-1-18 and EEOMP-1-19, which are encoded by genes in the reverse orientation, were clustered with P28-2, which is encoded by one of two reverse-oriented E. chaffeensis omp-1 genes. All proteins except α and β1 group proteins formed separate small clusters, including four proteins from each of the four Ehrlichia species. Each cluster of proteins is thus expected to share a common ancestor.

Previously reported 505-bp E. ewingii p28-1 sequences (GenBank accession numbers: AF287961, AF287962, AF287963, AF287964, AF287966) (Gusa, A. A., et al. (2001) J Clin Microbiol 39:3871-3876) were compared with corresponding sequences identified in the present study. The 505 bp begins at 16,918 bp and ends at 17,422 bp in the cluster, which corresponds to 75% of omp-1-16 (i.e., from 133 to 637 bp of the 849-bp omp-1-16 gene). The E. ewingii p28-1 sequences of a Missouri canine sample and an Oklahoma human sample (Gusa, A. A., et al. (2001) J Clin Microbiol 39:3871-3876) were identical to the sequence obtained from the Ohio dog analyzed in the present study.

TABLE 8 Comparison of the most closely related E. chaffeensis and E. canis OMPs with E. ewingii OMPs E. ewingii OMP-1 Most closely related % identity of paralogs Ehrlichia orthologs orthologs OMP-1-1 OMP-1M 66.2 OMP-1-2 OMP-1N 51.5 OMP-13 OMP-1Q 45.1 OMP-1-4 OMP-1P 51.6 OMP-1-5 OMP-1T 48.5 OMP-1-6 P30-14 60.9 OMP-1-7 OMP-1V 67.4 OMP-1-8 MAP1-8 18.9 OMP-1-9 P30-12 51.6 OMP-1-10 P30-11 59.5 OMP-1-11 OMP-1Y 55.8 OMP-1-12 P30-5 63.5 OMP-1-13 OMP-1H 59.4 OMP-1-14 OMP-1B 71.0 OMP-1-15 OMP-1E 60.8 OMP-1-16 OMP-1F 64.3 OMP-1-17 P28-1 69.4 OMP-1-18 P28-2 49.3 OMP-1-19 P28-2 47.1

E. ewingii OMP-1-Specific Peptide ELISA.

Following our OMP-1 amino acid sequence alignment, repetitive sequence analysis, and phylogenic analysis results, we designed E. ewingii OMP-1-specific peptides for serologic tests. As OMP-1s share repetitive common or homologous amino acid sequences with OMP-1 s of the same or different Ehrlichia sp., it is difficult to design recombinant proteins (>10 kDa) that provide Ehrlichia sp. specific or gene-specific antigens. Also, to clone, express, and purify 19 recombinant OMP-1 proteins are cost and labor-prohibitive. Therefore, we designed 12-17-mer peptides specific to each of the 19 E. ewingii OMP-is. For this purpose, extracellular loops of the 19 E. ewingii OMP-1s were first predicted using the Posterior Decoding method of PRED-TMBB bioinformatics.biol.uoa.gr/PRED-TMBB) (Bagos, P. G., et al. (2004) Nucleic Acids Res 32:W400-404). PRED-TMBB is a web server capable of predicting transmembrane strands and topology of β-barrel in OMPs of Gram-negative bacteria based on a Hidden Markov Model. The validity of these predictions is tested using non-homologous OMPs with structures known at atomic resolution according to the Conditional Maximum Likelihood criteria (Bagos, P. G., et al. (2004) Nucleic Acids Res 32:W400-404). Relatively highly antigenic and hydrophilic 12-17-mer peptide fragments located within one of the extracellular loops were chosen from each of the 19 EEOMP-1 amino acid sequences based on DNASTAR Protean analysis. Using the program BLAST, these peptide sequences were compared with the entire E. ewingii omp-1 locus and the E. chaffeensis, E. canis and E. ruminantium genome sequences to synthesize one peptide specific to each of the 19 EEOMP-1 (FIG. 8, Table 6).

Plasma from three dogs experimentally infected with E. ewingii and preinfection plasma from four dogs were then tested in ELISA containing the 19 EEOMP-1-specific peptides. Thirteen peptides (EEOMP-1-1, EEOMP-1-2, EEOMP-1-3, EEOMP-1-4, EEOMP-1-5, EEOMP-1-8, EEOMP-1-9, EEOMP-1-10, EEOMP-1-13, EEOMP-1-14, EEOMP-1-15, EEOMP-1-16, and EEOMP-1-19) were consistently recognized with plasma from three dogs experimentally infected with E. ewingii compared with preinfection dog plasma (FIG. 6).

As geographical distributions, vector ticks and animal reservoirs overlap between E. ewingii and E. chaffeensis, it is important to distinguish them by a simple assay. Therefore, we examined the immunological cross reactivity of these peptides with plasma from three dogs experimentally infected with E. chaffeensis. Among 13 EEOMP-1s specifically recognized in E. ewingii-infected dogs, EEOMP-1-8, EEOMP-1-10, and EEOMP-1-15 were recognized by one of three E. chaffeensis-infected dogs (FIG. 6). While more specimens need to be tested, the peptide-pin ELISA result suggests that of the remaining 10 EEOMP-1 peptides, 8 peptides (EEOMP-1-1, EEOMP-1-3, EEOMP-1-4, EEOMP-1-5, EEOMP-1-13, EEOMP-1-14, EEOMP-1-16, and EEOMP-1-19) serve as good candidate antigens for E. ewingii serodiagnosis based on high sensitivity (indicated by the ratio of E. ewingii plasma reactivity/control plasma reactivity) and good specificity (indicated by the ratio of E. chaffeensis plasma reactivity/control plasma reactivity, −1.00).

Discussion

In the present study and for the first time, the entire 24-kb E. ewingii omp-1 locus containing 19 omp-1 genes was sequenced. As the only available source of E. ewingii DNA was a small amount of the infected dog blood specimen, we employed touchdown PCR. This method has been used previously to amplify a small amount of fragmented Aegyptianella pullorum DNA from archival paraffin sections on glass slides. Incorrect base calls resulting from amplification or sequencing errors have been minimized in the present study because large pools of PCR products were directly sequenced. In addition, multiple overlapping regions throughout the sequences ensure the reliability of sequencing results. So far, only a few E. ewingii genes, including 16S rRNA, groESL, p28-19, dsbA (GenBank Accession No. DQ902688), gitA (GenBank Accession No. DQ365879), and disulfide oxidoreductase have been reported. Applying a similar approach as used here, it would be possible to obtain DNA sequences of other genomic regions to further our understanding of this uncultivable emerging zoonotic pathogen.

Because E. ewingii infects granulocytes, the distinction between E. ewingii and a strain of A. phagocytophilum was unclear prior to the molecular era. However, in concordance with the 16S rRNA and groESL sequence-based classification of this bacterium, our finding of the complete OMP-1 cluster structure flanked with tr1 and secA clearly demonstrated that E. ewingii belongs to the genus Ehrlichia. Synteny analysis suggests that the OMP clusters existed in a common ancestor of the present day four Ehrlichia species. Furthermore, the locus appears to have been partially scrambled as species evolved. The E. ewingii OMP-1 cluster has greater synteny with monocytotropic E. chaffeensis and E. cards than with endotheliotropic E. ruminantium. It is possible that OMP-1s and host cell type specificity co-evolved.

The present study revealed 19 E. ewingii OMP-1 amino acid sequences and examples of 19 E. ewingii immunogenic amino acid sequences. Studies on E. chaffeensis have shown an important role for OMP-1/P28 outer membrane proteins in the stimulation of host immune response and protection of the host from infection. Immunization with recombinant P28 (one of the major outer membrane OMP-1/P28 family members) has been shown to protect mice from E. chaffeensis challenge. The monoclonal antibody against OMP-1 g (P28) mediates protection of SCID mice from E. chaffeensis fatal infection. While antibodies against a single OMP-1 protein confer partial protection, existence of multiple homologous surface proteins likely plays a role in the organism's evasion of host immune response. A recent proteomic study showed that 18 out of 21 E. chaffeensis OMP-1/P28 family proteins are indeed bacterial surface-exposed, supporting the idea of immunoevasion. The number of E. ewingii omp-1 genes found in the OMP-1 cluster (19 copies) was similar to that of E. canis (22 copies, but there is an additional locus with duplicates of three p30s) E. chaffeensis (22 copies), and E. ruminantium (16 copies). In addition, there is extensive diversification among omp-1 genes of E. ewingii, similar to other Ehrlichia spp., supporting the hypothesis that multiple omp-1/p28 paralogs present in Ehrlichia sp. are involved in immunoavoidance. Thus, theses studies suggest that incorporation of immunogenic peptides of multiple OMP-1s in the vaccine preparation may provide better protection against Ehrlichia infection than the use of a single OMP-1 in the vaccine.

Multiple OMP-1/P28 and P30 mRNAs are expressed by E. chaffeensis and E. canis during experimental infections of dogs with these bacteria. All 22 E. chaffeensis P28 recombinant antigens are recognized by sera from two dogs experimentally infected with E. chaffeensis. Similarly, the present results suggest all 19 EEOMP-1 peptides were recognized by the plasma from three E. ewingii-infected dogs. Thus, the lack of immunological cross-reactivity of E. canis and E. chaffeensis OMP-1/P28/P30 with plasma from human patients or dogs infected with E. ewingii in the previous studies is likely due to divergence of the amino acid sequences of the E. ewingii OMP-is from those of the E. canis and E. chaffeensis OMP-1/P28/P30 proteins expressed in cell culture. It is also most likely that in E. ewingii infected humans and dogs, different combinations of multiple OMP-1s are expressed at different stages of infection, and under different immune and health status of animals. Therefore, for serodiagnosis of E. ewingii infection in both humans and animals, use of a combination of EEOMP-1 peptides as serodiagnostic antigens is expected to provide more sensitive and more specific serodiagnosis with broader coverage than the use of a single EEOMP-1 antigen. Furthermore, the DNA sequence data also obtained in the present study should help refine diagnostic PCR for human and dog granulocytic ehrlichioses to make this direct test more reliable for all infective species. 

The invention claimed is:
 1. An isolated OMP-1-14 E. ewingii polypeptide consisting of a sequence that is at least 85% identical to amino acids 26 to 285 of SEQ ID NO:
 17. 2. The isolated polypeptide of claim 1, wherein the polypeptide consists of a sequence that is at least 90% identical to amino acids 26 to 285 of SEQ ID NO:
 17. 3. The isolated polypeptide of claim 1, wherein the polypeptide consists of a sequence that is at least 95% identical to amino acids 26 to 285 of SEQ ID NO:
 17. 4. The isolated polypeptide of claim 1, wherein the polypeptide consists of amino acids 26 to 285 of SEQ ID NO:
 17. 5. The isolated polypeptide of claim 1, wherein the polypeptide contains an immunoreactive fragment that is 6 or more consecutive amino acids from the following sequences: (1) SEQ ID NO: 150; (2) SEQ ID NO: 169; (3) SEQ ID NO: 187; (4) SEQ ID NO: 222; or any combination of the sequences (1)-(4).
 6. The isolated polypeptide of claim 5, wherein the immunoreactive fragment is the amino acid sequence of SEQ ID NO:
 38. 7. The isolated polypeptide of claim 4, wherein the polypeptide consists of the amino acid sequence of SEQ ID NO:
 17. 8. An immunogenic composition comprising a polypeptide according to claim 1 and a pharmaceutically acceptable carrier, wherein said composition is capable of producing antibodies specific to E. ewingii in a subject to whom the composition has been administered.
 9. The immunogenic composition of claim 8, wherein the polypeptide is operatively linked to an N-terminal or C-terminal peptide or tag.
 10. The immunogenic composition of claim 8, wherein the polypeptide consists of a sequence that is at least 90% identical to amino acids 26 to 285 of SEQ ID NO:
 17. 11. The immunogenic composition of claim 8, wherein the polypeptide consists of a sequence that is at least 95% identical to amino acids 26 to 285 of SEQ ID NO:
 17. 12. The immunogenic composition of claim 8, wherein the polypeptide consists of amino acids 26 to 285 of SEQ ID NO:
 17. 13. The immunogenic composition of claim 8, wherein the polypeptide contains an immunoreactive fragment that is 6 or more consecutive amino acids from the following sequences: (1) SEQ ID NO: 150; (2) SEQ ID NO: 169; (3) SEQ ID NO: 187; (4) SEQ ID NO: 222; or any combination of the sequences (1)-(4).
 14. The immunogenic composition of claim 13, wherein the immunogenic fragment has at least 6 amino acids that are bacterial surface exposed.
 15. A kit for detecting antibodies specific for E. ewingii, the kit comprising a polypeptide according to claim 1, and one or more reagents for detecting interactions between the polypeptide according to claim 1 and antibodies specific for E. ewingii in a test sample.
 16. The kit of claim 15, wherein the polypeptide consists of amino acids 26 to 285 of SEQ ID NO:
 17. 